29n102531.doc

advertisement
INTERNATIONAL ORGANISATION FOR STANDARDISATION
ORGANISATION INTERNATIONALE DE NORMALISATION
ISO/IEC JTC 1/SC 29/WG 11
CODING OF MOVING PICTURES AND AUDIO
ISO/IEC JTC 1/SC 29/WG 11
N10314
Lausanne, CH – February 2008
Source: Leonardo Chiariglione
Title:
Report of 87th meeting
Status
Report of 87th meeting .......................................................................................................................... 1
Annex A – Attendance list .................................................................................................................. 18
Annex B – Agenda .............................................................................................................................. 23
Annex C – Input contributions ............................................................................................................ 26
Annex D – Output documents............................................................................................................. 46
Annex E – Requirements report .......................................................................................................... 54
Annex F – Systems report ................................................................................................................... 58
Annex G – Video report ...................................................................................................................... 97
Annex I – Audio report ..................................................................................................................... 115
Annex J – 3DG report ....................................................................................................................... 148
Report of 87th meeting
1
Opening
The 87th MPEG Meeting was held from 2nd to 6th February 2009 at Ecole Polytechnique Fédérale
de Lausanne (EPFL), Lausanne, Switzerland.
2
Roll call of participants
The attendance list is given in Annex 1.
3
Approval of agenda
The agenda is given in Annex 2.
4
Allocation of contributions
The input contribution are listed in Annex 3.
5
Communications from Convenor
There was no specific communication.
1
6
Report of previous meeting
This was approved
7
Processing of NB Position Papers
Input documents from National Bodies were presented, discussed and a response provided, as
appropriate.
8
Work plan management
8.1 Media coding
8.1.1
HD-AAC Profile
The following document was approved
10385
8.1.2
ISO/IEC 14496-3:2009/FPDAM 1:200X HD-AAC Profile
960 frame length in MPEG-4 AAC
The following document was approved
10434
8.1.3
Issues concerning frame lengths in the AAC family profiles
Constrained Baseline Profile
The following documents were approved
10341
10342
8.1.4
Disposition of Comments on ISO/IEC 14496-10:200X/PDAM 1
Text of ISO/IEC 14496-10:200X/FPDAM 1 Constrained Baseline Profile and
supplemental enhancement information
Multiview Field High Profile
The following document was approved
10344
8.1.5
Working Draft 1 of ISO/IEC 14496-10:200X/Amd.2 Multiview Field High Profile
AFX 3rd edition
The following document was approved
10331
8.1.6
WD of ISO/IEC 14496-16 3rd Edition
Multiresolution profile
The following documents were approved
10329
10330
Text of ISO/IEC 14496-16:2006/PDAM4 (Scalable Complexity 3D Mesh
Compression)
CE on Scalable Complexity 3D Mesh Compression
2
8.1.7
Open Font Format extensions
The following documents were approved
10450
10451
8.1.8
DoC on ISO/IEC FCD 14496-22 2nd Edition
Text of ISO/IEC FDIS 14496-22 2nd Edition
Media Value Chain Ontology
The following documents were approved
10454
10455
8.1.9
Draft DoC on ISO/IEC CD 21000-19 Media Value Chain Ontology
Draft Text of ISO/IEC FCD 21000-19 Media Value Chain Ontology
Codec Configuration Representation
The following documents were approved
10348
10349
Disposition of Comments on ISO/IEC FCD 23001-4
Text of ISO/IEC FDIS 23001-4 Codec Configuration Representation
8.1.10 Video Tool Library
The following documents were approved
10350
10351
10352
10354
10355
10356
Disposition of Comments on ISO/IEC FCD 23002-4
Text of ISO/IEC FDIS 23002-4 Video Tool Library
Request for ISO/IEC 23002-4/Amd.1
WD 4 of ISO/IEC 23002-4/Amd.2 (Tools for MPEG-2 MP, MPEG-4 ASP, AVC HP
and SVC)
RVC Work Plan and FU Development Status
Description of Core Experiments in RVC
8.1.11 MPEG Surround
The following document was approved
10386
Thoughts on MPEG Surround Signaling
8.1.12 Spatial Audio Object Coding
The following documents were approved
10416
10417
Study on ISO/IEC FCD 23003-2:200x, Spatial Audio Object Coding
Status and Workplan on SAOC Core Experiments
8.1.13 Unified Speech and Audio Coding
The following documents were approved
10418
10419
WD2 of USAC
Workplan for USAC CEs
3
10420
10421
10422
10423
MPEG Reference Encoder and the Audio CE Process
Workplan on MPEG Reference Encoder
Draft Revisions to MPEG Audio CE methodology
Thoughts on Efficient Bitstream Syntax
8.1.14 Interfaces with virtual worlds
The following documents were approved
10498
10474
10475
10476
10477
Requirements for MPEG-V Version 3.2
WD of Architecture
WD of Sensory Information
WD of Avatar Information
WD of Control Information
8.1.15 3D Video Coding
The following documents were approved
10357
10358
10359
10360
Vision on 3D Video Coding
Applications and Requirements of 3D Video Coding
Call for 3D Test Material: Depth Maps & Supplementary Information
Description of Exploration Experiments in 3D Video Coding
8.1.16 High-Performance Video Coding
The following documents were approved
10361
10362
10363
Vision and Requirements for High-Performance Video Coding (HVC)
Call for Test Materials for High-Performance Video Coding Standardisation
Draft Call for Evidence on High-Performance Video Coding
8.2 Composition coding
8.2.1
General
The following document was approved
10449
8.2.2
Clarification on the usage of ISO/IEC 14496-20 by other standardization bodies
Interactive Digital Radio
The following documents were approved
10503
10439
8.2.3
Requirements v2.0 for a new BIFS profile to support Interactive Digital Radio
WD 1.0 of ISO/IEC 14496-11:2002/AMD 7 New BIFS profile
LASeR Adaptation
The following documents were approved
4
10446
10447
10448
8.2.4
DoC on ISO/EC 14496-20:2008/PDAM 2 Adaptation
Text of ISO/EC 14496-20:2008/FPDAM 2 Adaptation
Workplan for service example of LASeR Adaptation & PSI
Presentation of Structured Information
The following document was approved
10453
WD2.0 of ISO/IEC 21000-2 AMD PSI
8.3 Description coding
8.3.1
Video Signature Tools
The following document was approved
10345
8.3.2
Description of Core Experiments in Video Signature Description development
Audio description coding standards
The following documents were approved
10399
10413
DoC on ISO/IEC TR 15938-8:2002/PDAM 4, Extraction of audio features from
compressed formats
ISO/IEC TR 15938-8:2002/DAM 4, Extraction of audio features from compressed
formats
8.4 Transport and File formats
8.4.1
Carriage of MVC in MPEG-2 Systems
The following documents were approved
10435
10436
8.4.2
DoC on ISO/IEC 13818-1:2007/PDAM4 Transport of MVC
Text of ISO/IEC 13818-1:2007/FPDAM4 Transport of MVC
Miscellaneous additions to File Format
The following document was approved
10442
8.4.3
Study of ISO/IEC 14496-12:200X/FPDAM 1 General Improvements
Handling of MPEG-4 Audio enhancement layers
The following documents were approved
10500
10501
Request for ISO/IEC 14496-14:2003/PDAM1 Handling of MPEG-4 Audio
enhancement layers
Text of ISO/IEC 14496-14:2003/PDAM1 Handling of MPEG-4 Audio enhancement
layers
5
8.4.4
AVC File Format extensions for MVC
The following documents were approved
10444
10445
8.4.5
DoC on ISO/IEC 14496-15:2004/PDAM 3 MVC File Format
Text of ISO/IEC 14496-15:2004/FPDAM 3 MVC File Format
Modern MPEG Transport
The following document was approved
10496
MPEG Modern Transport (MMT) over Networks
8.5 Multimedia architecture
8.5.1
MXM general
The following document was approved
10469
Proposal for new work item
8.5.2
MXM Architecture and Technologies
The following document was approved
10470
8.5.3
Text of ISO/IEC CD 23006-1 MXM Architecture and Technologies
MXM API
The following document was approved
10471
8.5.4
Text of ISO/IEC CD 23006-1 MXM APIs
Advanced IPTV Terminal
The following documents were approved
10497
10478
10479
Draft Advanced IPTV Terminal (AIT) Requirements
Ideas on the new AIT project
Ideas on How to Implement Collaboration Between MPEG and ITU-T Q.13/SG16
on the Advanced IPTV Terminal Standardisation
8.6 Application formats
8.6.1
DMB AF Harmonization of MPEG-2 TS storage
The following documents were approved
10461
10462
Request for ISO/IEC 23000-9:2008/PDAM2 DMB AF Harmonization of MPEG-2
TS storage
Text of ISO/IEC 23000-9:2008/PDAM2 DMB AF Harmonization of MPEG-2 TS
storage
6
8.6.2
Interactive Music Application Format
The following document was approved
10468
Text of ISO/IEC CD 23000-12 Interactive Music AF
8.7 Protocols
8.7.1
MXM Protocols
The following document was approved
10473
Text of ISO/IEC CD 29116-1 2nd edition MXM Protocols
8.8 Reference implementation
8.8.1
MVC Reference Software
The following documents were approved
10339
10340
8.8.2
Disposition of Comments on ISO/IEC 14496-5:2001/PDAM 15
Text of ISO/IEC 14496-5:2001/FPDAM 15 Reference Software for Multiview Video
Coding
3D Graphics Compression Model Reference Software
The following documents were approved
10323
10324
8.8.3
DOCR on ISO/IEC 14496-5:2001/FPDAM 22 (3DGCM Reference Software)
Text of ISO/IEC 14496-5:2001/FDAM 22 (3DGCM Reference Software)
SC3DMC Reference Software
The following documents were approved
10326
10327
8.8.4
Request for Amendment: 14496-5:2001/PDAM27
Text of ISO/IEC 14496-5:2001/PDAM27 (SC3DMC RefSoft)
Scene Partitioning Reference Software
The following document was approved
10325
8.8.5
Text ISO/IEC 14496-5:2001/FPDAM 25 (Scene Partitioning Reference Software)
Professional Archival MAF Reference Software
The following documents were approved
10456
10457
10458
DoC on ISO/IEC 23000-6 PA-AF/PDAM 1 Conformance and Reference Software
Text of ISO/IEC 23000-6 PA-AF/FPDAM 1 Conformance and Reference Software
Workplan for ISO/IEC 23000-6 PA-AF Conformance and Reference Software
7
8.8.6
DMB Application Format
The following documents were approved
10459
10460
8.8.7
Text of ISO/IEC 23000-9:2008/FPDAM1 DMB AF Conf. And Ref. Soft.
Workplan for DMB AF Conf. And Ref. Soft.
Video Surveillance AF Reference Software
The following documents were approved
10463
10464
8.8.8
DoC on ISO/IEC 23000-10 PDAM1 Video Surveillance Application Format Conf. &
Ref. SW.
Text of ISO/IEC 23000-10 FPDAM1 Video Surveillance Application Format Cof. &
Ref. SW.
Stereoscopic video AF Reference Software
The following document was approved
10466
8.8.9
WD 1.0 of ISO/IEC 23000-11/AMD1 Stereoscopic Video Application Format Conf.
& Ref. SW.
Video Tool Library Reference Software
The following document was approved
10353
Text of ISO/IEC 23002-4/PDAM1 Video Tool Library Conformance and Reference
Software
8.8.10 MXM Reference Software
The following documents were approved
10507
10472
List of identified non MPEG members to be allowed to access MPEG SVN
repository
Text of ISO/IEC CD 23006-1 MXM Conf. & Ref. SW
8.9 Conformance
8.9.1
MPEG-4 Audio Conformance
The following documents were approved
10394
10395
10397
10398
Request for Subdivision of 14496, Audio Conformance
ISO/IEC 14496-26:2009, Audio Conformance
ISO/IEC 14496-26:2009/FPDAM 1, AAC-ELD, OAFI, additional AAC and MPEG1/2 on MPEG-4 Conformance
WD on additional BSAC conformance streams for broadcasting
8
8.9.2
AAC-ELD, OAFI and additional AAC Conformance
The following document was approved
10391
8.9.3
DoC on ISO/IEC 14496-4:2004/PDAM 36, AAC-ELD, OAFI and additional AAC
Conformance
MVC Conformance
The following documents were approved
10337
10338
8.9.4
Disposition of Comments on ISO/IEC 14496-4:2004/PDAM 38
Text of ISO/IEC 14496-4:2004/FPDAM 38 Multiview Video Coding Conformance
Testing
File Format Conformance
The following document was approved
10438
8.9.5
Text of ISO/IEC 14496-4:2004/FPDAM 37 File Format Conformance
Improvements
3DG conformance
The following documents were approved
10332
10333
10334
10433
10335
8.9.6
Request for subdivision of ISO/IEC 14496-27
Text of ISO/IEC 14496-27:2009/FDIS (3DG Conformance)
Text of ISO/IEC 14496-27:2009/FPDAM1 (Scene partitioning conformance)
Request for Amendment: 14496-27:2009/PDAM2 (SC3DMC Conformance)
Text of ISO/IEC 14496-27:2009/PDAM2 (SC3DMC Conformance)
MultiResolution Profile Conformance
The following document was approved
10320
8.9.7
DOCR on ISO/IEC 14496-4:2004/FPDAM 33 (Multi Resolution Profile
Conformance)
3D Graphics Compression Model Conformance
The following document was approved
10321
8.9.8
DOCR on ISO/IEC 14496-4:2004/FPDAM 34 (3D Graphics Model Conformance)
3D Graphics Conformance
The following document was approved
10322
Text of ISO/IEC 14496-4:200x/DCOR 8 (Removal of 3DG Conformance)
9
8.9.9
Photo Player MAF Conformance
The following documents were approved
10346
10347
Request for ISO/IEC 23000-3/Amd.2
Text of ISO/IEC 23000-3/PDAM2 Conformance Testing for Photo Player MAF
8.9.10 Professional Archival MAF Conformance
The following documents were approved
10456
10457
10458
DoC on ISO/IEC 23000-6 PA-AF/PDAM 1 Conformance and Reference Software
Text of ISO/IEC 23000-6 PA-AF/FPDAM 1 Conformance and Reference Software
Workplan for ISO/IEC 23000-6 PA-AF Conformance and Reference Software
8.9.11 DMB Application Format
The following documents were approved
10459
10460
Text of ISO/IEC 23000-9:2008/FPDAM1 DMB AF Conf. And Ref. Soft.
Workplan for DMB AF Conf. And Ref. Soft.
8.9.12 Video Surveillance AF Conformance
The following documents were approved
10463
10464
DoC on ISO/IEC 23000-10 PDAM1 Video Surveillance Application Format Conf. &
Ref. SW.
Text of ISO/IEC 23000-10 FPDAM1 Video Surveillance Application Format Cof. &
Ref. SW.
8.9.13 Stereoscopic video AF Conformance
The following document was approved
10466
WD 1.0 of ISO/IEC 23000-11/AMD1 Stereoscopic Video Application Format Conf.
& Ref. SW.
8.9.14 Video Tool Library Conformance
The following document was approved
10353
Text of ISO/IEC 23002-4/PDAM1 Video Tool Library Conformance and Reference
Software
8.9.15 MXM Conformance
The following document was approved
10472
Text of ISO/IEC CD 23006-1 MXM Conf. & Ref. SW
10
8.10 Maintenance
8.10.1 Systems coding standards
The following documents were approved
10437
10440
10441
10499
10443
WD 1.0 of ISO/IEC 13818-1:2007 DCOR X
DoC on ISO/IEC 14496-12:200X/DCOR 2 Usage of brands and box order in sample
entry
Text of ISO/IEC 14496-12:200X/COR 2 Usage of brands and box order in sample
entry
Text of ISO/IEC 14496-12:2003/DCOR 3
Text of ISO/IEC 14496-15:2004/COR3
8.10.2 Video coding standards
The following document was approved
10343
Defect Report on ISO/IEC 14496-10:200X
8.10.3 Audio coding standards
The following documents were approved
10373
10374
10375
10376
10377
10378
10379
10380
10381
10382
10383
10384
10387
10388
10389
10390
10392
10393
10396
10414
10415
DoC on ISO/IEC 13818-4:2004/AMD 2:2005/DCOR 2, AAC Conformance
ISO/IEC 13818-4:2004/AMD 2:2005/Cor 2, AAC Conformance
DoC on ISO/IEC 13818-7:2006/DCOR 1, AAC
ISO/IEC 13818-7:2006/Cor. 1, AAC
DoC on ISO/IEC 14496-3:2005/DCOR. 6, AAC
ISO/IEC 14496-3:2005/Cor. 6, AAC
DoC on ISO/IEC 14496-3:2005/AMD 2:2006/DCOR 4, HE-AAC V2 Profile and
ALS
ISO/IEC 14496-3:2005/AMD 2:2006/Cor. 4, HE-AAC V2 Profile and ALS
DoC on ISO/IEC 14496-3:2005/AMD 3:2006/ DCOR 2, SLS
ISO/IEC 14496-3:2005/AMD 3:2006/Cor. 2, SLS
DoC on ISO/IEC 14496-3:2005/AMD 9:2008/DCor. 1, AAC-ELD
ISO/IEC 14496-3:2005/AMD 9:2008/Cor. 1, AAC-ELD
ISO/IEC 14496-4:2004/Cor. 6, AAC-LD
ISO/IEC 14496-4:2004/DCOR 7, Removal of Audio Conformance
DoC on ISO/IEC 14496-4:2004/AMD13:200x/DCOR 2, AAC-LD bitstreams
ISO/IEC 14496-4:2004/AMD13:200x/Cor. 2, AAC-LD bitstreams
DoC on ISO/IEC 14496-5:2001/Amd.10:2007/DCOR 3, ALS and SLS
ISO/IEC 14496-5:2001/Amd.10:2007/COR 3, ALS and SLS
ISO/IEC 14496-26:2009/DCOR 1, ALS and SLS updates
ISO/IEC 23003-1:2007/DCOR 2, Misc. Corrections
ISO/IEC 23003-1:2007/AMD 2:2008/DCOR 1, Ref. Sw. Update
8.10.4 Other MPEG-7 standards
The following document was approved
11
10452
Text of ISO/IEC 15938-12:2008 /COR 1
8.10.5 MPEG-A standards
The following document was approved
10467
Text of ISO/IEC 23000-11/DCOR1 (SVAF signalling of voice codecs)
8.11 Work plan and time line
The following documents were approved
10401
10402
10403
9
MPEG Standards
Table of unpublished FDISs
Work plan and time line
Organisation of this meeting
9.1 Tasks for subgroups
The following tasks were assigned
Requirements Std
4
A
V
Systems
Std
2
4
Pt Amd
10
11
Y
10
Pt Amd
1 4
4
11
12
15
20
7
22
12
37
23
?
?
1
Cor.2
3
2
3
2nd Ed
Cor1
MVC Profile
Laser-BIFS integration
User Interface framework
Advanced Video Surveillance
Responses to CfP for Interfaces with virtual worlds
Loudness metadata
Advanced IPTV Terminal
Contribution to press release

HVC - CfE
3DV – Vision, Applications, Requirements
New standard areas
 Audio
Carriage of MVC
RA
FF conformance
Synthesised texture RS
SVC FF RS
Laser-BIFS integration
Miscellanea
Usage of brands etc.
MVC File Format
Adaptation technologies for Laser
Presentation of Structured Information
Open Font Format
12
21
A
B
M
19
4 1
2
5 2nd Ed
6 1
9 Cor 1
9 1
9 2
10 ?
1
11 Cor1
1
12
2 1
1
2
3
V
MVCO
Musical Slide Show MAF RS & C
Protected Musical Slide Show MAF RS & C
Media Streaming MAF
Professional Archival AF RS & C
DMB MAF
DMB MAF RS & C
DMB MPEG-2 TS storage
Video Surveillance AF
Video Surveillance AF RS & C
Stereoscopic video AF
Stereoscopic video AF RS & C
Interactive music AF
Fragment Request Unit RS & C
MXM Architecture
API
RS & C
Information exchange with virtual worlds
Representation of sensory effects information
Advanced IPTV Terminal
Update MPEG technology web page
 MAFs

Contribution to press release
 AIT?
 MXM
 Interactive music AF
Video
7
A
B
C
JVT
4
3 4
6
7
8
3 2
4
4
4 1
2
4 38
15
10 Cor 1
1
Video Signature Tools
Image Signature Tools RS
Image Signature Tools C
Image Signature Tools Matching and feature extraction
Photo Player Conformance
Codec Configuration Description
Video Tool Library
Video Tool Library Conformance & RS
Video Tool Library extensions
3DV/FTV
HVC
Update MPEG technology web page
Contribution to press release
 RVC
 HVC CfE
 Emmy
MVC Conformance
MVC RS
Miscellanea
AVC Constrained Baseline Profile
13
Audio
4
D
4E
1
4 36
5 24
26
2
3
Description of work items
MPEG-4 Audio
SLS profile
AAC-ELD conformance
AAC-ELD Reference Software
Conformance
Spatial Audio Object Coding
USAC
New audio issues (HVC)
Contribution to press release
 MPEG reference encoder
3DG
4
27
1
5 22
25
27
16 3rd Ed
4
V
3DG Conformance
3D Graphics Compression model Conformance
3D Graphics Compression model Reference Software
Scene partitioning RS
Scalability complexity 3DMC RS
Scalable complexity 3DMC
Information exchange with virtual worlds
Contribution to press release

9.2 Joint meetings
The following joint meetings were held
Groups
Sys, 3dg
Sys, 3dg
Sys, aud
Sys, req
Sys, req
Vid, req
3dg, vid
Aud, req
What
Day
MXM
Tue
MPEG-V
Tue
FF
Wed
AIT
Wed
BIFS
Thu
Interl. MVC, HVC, 3DV Wed
RVC
Wed
Audio for HVC
Thu
Time
14:00-15:00
15:00-16:00
14:00-15:00
12:00-13:00
14:00-15:00
15:30-17:30
14:00-15:00
15:00-15:30
Where
3dg
3dg
aud
sys
sys
vid
3dg
aud
10 WG management
10.1 WG organisation
10.2 Terms of reference
Recently the amount of activities handled by JVT group has been constantly reducing and, as a
consequence, its attendance has also been reducing. Because the JVT is a joint group with ITU-T SG
16 the JVT entails a significant organisational budget. For a group like MPEG that operates on a
voluntary basis continuing the JVT must be balanced by significant benefits from its existence.
14
WG11 has decided to discontinue its support for continuing the JVT. This is reflected in the
following document approved at the meeting.
10400
Terms of reference
10.3 Editors
The following document was approved
10404
Editors of MPEG standards
10.4 Liaisons
The following liaison statements were issued
10319
10482
10483
10484
10485
10486
10487
10488
10489
10490
10491
10492
10493
10494
10495
10502
10504
10505
10506
10364
10366
10424
10425
10426
10427
10328
Liaison statement to ITU-T SG 16
Liaison statement to W3C on MXM
Liaison statement to WG 1 on PA-AF
Liaison statement to ITU-T SG16 on IPTV
Liaison statement to OMA BCAST on ISO/IEC 14496-20
Liaison statement to IEC TC 100 on HD Recorder/Receiver Interface
Liaison statement to IEC TC 100 on IP & TS based service access
Liaison statement to IEC TC 100 on digital right permission code
Liaison statement to JTC 1 Study Group on Sensor Network
Liaison statement to ISO TC 223 on Video Surveillance
Liaison statement to FNB on Informal workshop on Video Surveillance
Liaison statement to SC27 on new work item regarding digital evidence
Liaison statement to EDItEUR on MVCO
Liaison statement to IPFI on MVCO
Liaison statement to DOI on MVCO
Liaison statement to SMPTE 23B/Container on package formats
Liaison statement to WorldDMB on new BIFS profile
Liaison statement to GRN on new BIFS profile
Liaison statement to TTA on new BIFS profile
Liaison statement to ITU-R SG6 re Multi-view Video Coding
Liaison statement to ITU-T SG16 re interlaced Multi-view Video Coding
Response to DRM on MPEG-4 AAC Technology and Profiles
Response to ETSI/EBU/CENELEC JTC on MPEG-4 AAC Technology and Profiles
Response to WorldDMB Forum on MPEG-4 AAC Technology and Profiles
Response to IEC TC100/TA4 on IEC CDV 61937-11 and 60958-3/Amd.1
Liaison statement to SC 24
The following document was approved
10411
List of Organisations with which MPEG entertains liaisons
10.5 Ad hoc groups
The following ad hoc groups were established
15
10370
10336
10514
10510
10431
10372
10513
10371
10367
10509
10369
10512
10515
10368
10432
10508
AHG on 3D Video Coding
AHG on 3DGC documents, software maintenance and core experiments
AHG on Advanced IPTV Terminal
AHG on Application Format
AHG on Audio Standards Maintenance
AHG on AVC Development
AHG on Font Format Representation
AHG on High-Performance Video Coding
AHG on Maintenance of MPEG-4 Visual related Documents, Reference Software and
Conformance
AHG on MPEG File Formats
AHG on MPEG-7 Visual
AHG on MPEG-V (including previous RoSE activities)
AHG on MXM
AHG on Reconfigurable Video Coding
AHG on SAOC, USAC and MetaData
AHG on Scene Representation
10.6 Asset management
The following documents were approved
10405
10406
10407
10408
10409
Schema assets
Software assets
Conformance assets
Content assets
URI assets
10.7 IPR management
The following document was approved
10410
Standards under development for which a call for patent statements is issued
11 Administrative matters
11.1 Responses to National Bodies
The following responses to national Bodies were approved
10365
10428
10429
10430
Responses to National Bodies
Response to AUNB Comments on USAC
Response to AUNB Comments on MetaData
Response to FR, FI and CN NB Comments on USAC
11.2 Schedule of future MPEG meetings
The following schedule was approved
16
#
87
88
89
90
91
92
93
94
City
Country yy mm
Lausanne
CH
09 02
Maui, HI
US
09 04
London
UK
09 06-07
Xian
CN
09 10
Kyoto
JP
10 01
Melbourne
AU
10 04
Torino
IT
10 07
?
?
10 10
11.3 Promotional activities
The following documents were approved
10412
10357
10316
The MPEG Vision
Vision on 3D Video Coding
Lausanne press release
12 Resolutions of this meeting
13 A.O.B
14 Closing
17
dd-dd
02-06
20-24
29-03
26-30
18-22
12-16
19-23
11-15
Annex A – Attendance list
LASTNAME
AHN
ASAI
BARONCINI
BÄSE
BOBER
BOEHM
BOURGE
BRASNETT
BRULS
CABRERA
QUESADA
FirstName
Jeong-Hwan
Kohtaro
Vittorio
Gero
Miroslaw
Johannes
Arnaud
Paul
Fons
Affiliation
SAMSUNG Electronics
Mitsubishi Electric Corporation
Fondazione Ugo Bordoni
Siemens
Mitsubishi Electric
Deutsche Thomson OHG
ST-NXP Wireless
Mitsubishi Electric R&D Centre Europe
Philips
Country
KR
JP
IT
DE
UK
DE
FR
UK
NL
Julián
ES
CARBALLEIRA
CHAISORN
CHAN
CHEN
CHENG
CHEON
CHIARIGLIONE
CHIARIGLIONE
CHOI
CHOI
CHOI
CHONO
CHUJOH
CIEPLINSKI
CONCOLATO
CORDARA
CORVAGLIA
DAI YONG
DAVIES
DELGADO
DENIS
DIVORRA
ESCODA
DÖHLA
DUN
FRANCOIS
FRÖJDH
GAUVIN
GEIGER
GELISSEN
GERKE
GIOIA
GOURNAY
GRANT
GRANT
Pablo
Lekha
Ti Eu
Ying
Ka Man Carmen
Lee
Filippo
Leonardo
Bumsuk
Kiho
Miran
Keiichi
Takeshi
Leszek
Cyril
Giovanni
Marzia
Kim
Thomas
Jaime
Leon
Universidad Politécnica de Madrid
Grupo de Tratamiento de Imagenes Universidad Politecnica de Madrid
Institute for Infocomm Research
Institute For Infocomm Research (A*STAR)
Tampere University of Technology
MPEG-CHINA
Mr.
CEDEO.net
CEDEO.net
ETRI
Hanyang University
ETRI
NEC Corporation
Toshiba Corporation
Mitsubishi Electric R&D Centre Europe
Telecom ParisTech
Telecom Italia Lab
CNIT - Univ. Brescia
Hanyang University
BBC
Universitat Politècnica de Catalunya
Vrije Universiteit Brussel - ETRO dept.
Oscar
Stephan
Yujie
Edouard
Per
Marc
Ralf
Jean H.A.
Sebastian
Patrick
Philippe
John
Kate
Telefonica Research
Germany
Xian Jiaotong University
thomson
Ericsson
sDae
Germany
Philips Research Laboratories
Fraunhofer HHI
Orange Labs
VoiceAge Corporation
Nine Tiles
Nine Tiles
ES
DE
CN
FR
SE
ES
DE
NL
DE
FR
CA
UK
UK
18
ES
SG
SG
FI
HK
KR
IT
IT
KR
KR
KR
JP
JP
UK
FR
IT
IT
KR
UK
ES
BE
GRILL
GRÜNEBERG
GUEZ VUCHER
GUN
HAECHUL
HANNUKSELA
HARADA
HELLMUTH
HENNEY
HERRE
HONG
HUANG
HUANG
HUI YONG
HUSAK
HWA SEON
HWANG
ISHTIAQ
ITARU
ITO
ITO
IWAMOTO
JANG
JANG
JEON
JEONG
JIN
JUNG
KALVA
KANG
KAZUI
KEILER
KIKUIRI
KIM
KIM
KIM
KIM
KIM
KIM
KIM
KIM
KIM
KIM
KIMATA
KITAMURA
KJOERLING
KLOMP
KOGURE
KUDUMAKIS
LAGADEC
LE FEUVRE
LEE
Bernhard
Karsten
Marc
Bang
Choi
Miska
Noboru
Oliver
Oh
Juergen
Jin Woo
Pengjun
Tiejun
Kim
Walt
Shin
Seo-Young
Faisal
Kaneko
Satoshi
Takashi
Kota
Euee S.
Inseon
Byeungwoo
Dong-Seok
Jukyong
Yang-Won
Hari
Kyeongok
Kimihiko
Florian
Kei
Dongwon
Hae Kwang
Hyungyu
Jin-Seo
JungHoe
Kwangki
Sang-Kyun Kim
Seonghoon
Yeongmi
Yong-Goo
Hideaki
Masatsugu
SE
Sven
Takuyo
Panos
Owen
Jean
Chung Hee
Germany
Fraunhofer HHI
IFPI
ETRI
ETRI
Nokia Corporation
NTT
Germany
LG Electronics
Fraunhofer IIS
ETRI
Qualcomm Inc.
Peking University
ETRI
US - SMPTE
KETI
Samsung Electronics Co. Ltd
Motorola Inc.
TOKYO POLYTECHNIC UNIVERSITY
TOSHIBA CORPORATION
Fujitsu Laboratories Ltd.
NEC Corporation
Hanyang University
ETRI
Sungkyunkwan University
Inha University
inha university
LG Electronics
Florida Atlantic University
ETRI
Fujitsu Laboratories Ltd.
Thomson
NTT DOCOMO, INC.
Sejong Univ.
Sejong University
Hanyang University
ETRI
Samsung Electronics Co. Ltd
Information and Communications University
Myongji University
VAROVISION
Gwangju Institute of Science and Technology
Yonsei Univ.
NTT Corporation
For more convenient AV life
Dolby Sweden AB
Institut für Informationsverarbeitung
Panasonic
Queen Mary University of London
AFNOR
Telecom ParisTech
ETRI
19
DE
DE
FR
KR
KR
FI
JP
DE
KR
DE
KR
US
CN
KR
US
KR
KR
US
JP
JP
JP
JP
KR
KR
KR
KR
KR
KR
US
KR
JP
DE
JP
KR
KR
KR
KR
KR
KR
KR
KR
KR
KR
JP
JP
Sweden
DE
JP
UK
FR
FR
KR
LEE
LEE
LEE
LEE
LEE
LEE
LEE
LEFEBVRE
LEVANTOVSKY
LI
LIEBCHEN
LIM
LIM
LIM
LIM
LOPEZ
LUTHRA
MASASHI
MATSUO
MATTAVELLI
MCCANN
MOON
MORÁN BURGOS
MORIYA
MOSCHETTI
MOTTA
MÜLLER
MULTRUS
MURAKAMI
NA
NAKACHI
NAKAYAMA
NARASIMHAN
NARROSCHKE
NEUENDORF
NISHI
NOMURA
NORIMATSU
OAMI
OGURA
OH
OH
OHM
OOMEN
OSTERMANN
PARK
PASCHALAKIS
PATEUX
Gwo Giun
Hyunkook
Jaejoon
Kangchan
Seung Wook
Taejin
Wonsuk
Roch
Vladimir
Zhengguo
Tilman
ChongSoon
Jung Eun
Sung-Chang
Youngkwon
Patrick
Ajay
Takahashi
Shohei
Marco
Ken
Joohee
Francisco
Takehiro
Fulvio
Giovanni
Karsten
Markus
Tokumichi
Sang-Il
Takayuki
Yasushige
Sam
Matthias
Max
Takahiro
Toshiyuki
Takeshi
Ryoma
Yukiko
Eunmi
Weongeun
Jens-Rainer
Werner
Joern
Kyungmo
Stavros
Stéphane
PENG
PHILIPPE
PREDA
PRETEUX
Zhang
Pierrick
Marius
Francoise
National Cheng Kung University
LG electronics
Samsung Electronics
ETRI
ETRI
ETRI
ETRI
Université de Sherbrooke
Monotype Imaging Inc.
Dr.
LG Electronics
Singapore National Body
LG Electronics
ETRI
net&tv Inc.
THOMSON
Motorola
Hitachi, Ltd.
NTT Corporation
EPFL
ZetaCast, representing Samsung
Sejong Univ.
Universidad Politécnica de Madrid
NTT
EPO
Qualcomm Inc.
Fraunhofer HHI
Germany
Mitsubishi Electric Corporation
ETRI
NTT
NHK
US National Body
Panasonic
Germany
Panasonic
NEC
Panasonic
NEC Corporation
IPSJ/ITSCJ
Samsung Electronics
ETRI
RWTH Aachen University
Philips Applied Technologies
Leibniz Universität Hannover
Samsung Electonics Co., Ltd.
Mitsubishi Electric R&D Centre Europe
Orange Labs
Core Network Research Department, Huawei
Technologies Co., Ltd
Orange Labs
Institut TELECOM
Institut TELECOM
20
CN
KR
KR
KR
KR
KR
KR
CA
US
SG
DE
SG
KR
KR
KR
FR
US
JP
JP
CH
UK
KR
ES
JP
DE
US
DE
DE
JP
KR
JP
JP
US
DE
DE
JP
JP
JP
JP
JP
KR
KR
DE
NL
DE
KR
UK
FR
CN
FR
FR
FR
PRIMAUX
PURNHAGEN
QUACKENBUSH
RAAD
RADULOVIC
RAULET
RIDGE
RODRIGUEZ
RODRIGUEZDONCEL
ROSSIER
RYU
SABIRIN
SAKAZUME
SAMPAIO LOBO
SANGHYUN
SATTI
SCHNEIDER
SCHREINER
SEGALL
SEKIGUCHI
SEO
SHEEN WOOK
SHIMIZU
SHIMOR
SHU
SINGER
SMYTH
SONG
SPERSCHNEIDE
R
STANKIEWICZ
SUGIMOTO
SUH
SUN
SUNG
SUZUKI
SUZUKI
SUZUKI
TADDEI
TAKANORI
TAN
TANIMOTO
TANIZAWA
TERENTIEV
TESCHER
THOMA
TIAN
TIMMERER
TOKUMO
TRIMECHE
TUNG
Laurent
Heiko
Schuyler
Mohamad
Ivana
Mickael
Justin
Eva
AFNOR
Dolby Sweden AB
Audio Research Labs
RaadTech Consulting
Ericsson
IETR/INSA of Rennes
Nokia
UPC
FR
SE
US
AU
SE
FR
US
ES
Victor
Jean
Jeha
Muhammad Syah
Houari
Satoru
Lincoln
Joo
Shahid
Andreas
Stephan
Andrew
Shun-ichi
Jeongil
Lee
Shinya
Avraham
Haiyan
David
Neil
Jaeyeon
Universitat Politècnica de Catalunya (UPC)
Nagravision
Gwangju Institute of Science and Technology
ES
CH
KR
Information and Communications University
Victor Company of Japan, Limited
Philips
ETRI
Vrije Universiteit Brussel - ETRO dept.
Dolby Germany GmbH
Germany
Sharp
Mitsubishi Electric Corporation
ETRI
Hanyang Univeristy
NTT
SanDisk Corporation
Institute for Infocomm Research
USA
IST/37
Samsung
KR
JP
NL
KR
BE
DE
DE
US
JP
KR
KR
JP
Israel
SG
US
UK
KR
Ralph
Olgierd
Kazuo
Jung Suk
Huifang
Jaewon
Kazuyoshi
Teruhiko
Yoshinori
Hervé
Senoh
Thiow Keng
Masayuki
Akiyuki
Leonid
Andrew
Herbert
Dong
Christian
Yasuaki
Mejdi
Yi Shin
Fraunhofer IIS
Mr
Mitsubishi Electric Corporation
Samsung Electronics Co.,Ltd
Mitsubishi Electric Research Labs
LG Electornics
Nagoya University
Sony Corp.
NTT DOCOMO, INC
Huawei Technologies
National Institute of Info & Comm tech
NTT DoCoMo, Inc.
Nagoya University
TOSHIBA Corporation
Fraunhofer Institut
Microsoft
germany
Thomson Inc
Klagenfurt University
Sharp Corporation
Nokia Research Center
MStar Semiconductor
DE
PL
JP
KR
US
KR
JP
JP
JP
DE
JP
JP
JP
JP
DE
US
DE
US
AT
JP
FI
CN
21
UGUR
UM
VAN DER
AUWERA
VERMEIRSCH
VETRO
WANG
WITTMANN
WOO-JIN
XIE
XIONG
YAMAKAGE
YAMAMOTO
YASHIMA
YE
YOO
YOSHINO
YU
YU
Kemal
Gi-Mun
Nokia
ETRI
FI
KR
Geert
Kenneth
Anthony
Xin
Steffen
Han
Minjie
Lianhuan
Tomoo
Tomoyuki
Yoshiyuki
Yan
Jeong-Ju
Tomonobu
Haoping
Lu
US
BE
US
US
DE
KR
US
CN
JP
JP
JP
US
KR
JP
US
CN
YUN
ZHAO
ZHONG
ZHOU
ZHU
Kugjin
Yin
Hai Shan
Huan
Yongwei
Samsung Information Systems
Ghent University/MMLab
Mitsubishi Electric
ContentGuard, Inc.
Panasonic
Samsung Electronics
Huawei Technologies (USA)
Huawei Technologies Co., Ltd.
TOSHIBA Corporation
Sharp
NTT
Qualcomm Inc
ETRI
KDDI
Huawei Technologies (USA)
Zhejiang University
ETRI(Electronics and Telecommunications
Research Institute)
Zhejiang University
Panasonic Singapore Laboratories Pte Ltd
Panasonic Singapore Laboratories Pte Ltd
Institute for Infocomm Research
22
KR
CN
SG
SG
SG
Annex B – Agenda
Item
1
2
3
4
5
6
7
8
1
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
2
1
2
4
3
1
2
3
4
4
5
6
7
1
2
3
4
8
Opening
Roll call of participants
Approval of agenda
Allocation of contributions
Communications from Convenor
Report of previous meeting
Processing of NB Position Papers
Work plan management
Media coding
HD-AAC Profile
New Profile for ALS
960 frame length in MPEG-4 AAC
AFX 3rd edition
Multiresolution profile
Scalable-complexity 3D mesh compression
Open Font Format extensions
Media Value Chain Ontology
Codec Configuration Representation
Video Tool Library
Spatial Audio Object Coding
Unified Speech and Audio Coding
Representation of Sensory Experience
3D Video Coding
High-Performance Video Coding
New directions in future audio coding
Composition coding
Interactive Digital Radio
LASeR Adaptation
Presentation of Structured Information
Description coding
Video Signature Tools
Metadata driven post processing of audio signals
Audio description coding standards
Extraction and Matching of Image Signature Tools
Systems support
IPMP
Digital Item
Transport and File formats
Carriage of SVC in MPEG-2 Systems
Carriage of MVC in MPEG-2 Systems
Miscellaneous additions to File Format
AVC File Format extensions for MVC
Multimedia architecture
23
1
2
3
4
5
9
1
10
1
11
1
2
3
4
5
6
7
8
9
10
11
12
12
1
2
3
4
5
6
7
8
9
10
11
12
13
14
13
1
2
3
4
5
6
7
8
14
9
Interfaces with virtual worlds
MXM Architecture and API
MXM API
Advanced IPTV Terminal
Rich Media UI Framework
Application formats
Interactive Music Application Format
Protocols
MXM Protocols
Reference implementation
AAC-ELD Reference Software
MVC Reference Software
File Format Reference Software
Geometry and Shadow Reference Software
3D Graphics Compression Model Reference Software
Scene Partitioning Reference Software
Image Signature Tools Reference Software
Protected Musical Slide Show MAF Reference Software
Musical Slide Show MAF Reference Software
Professional Archival MAF Reference Software
Video Surveillance MAF Reference Software
MXM Reference Software
Conformance
MVC Conformance
MPEG-4 Audio Conformance
AAC-ELD, OAFI and additional AAC Conformance
File Format Conformance
Scene Partitioning Conformance
MultiResolution Profile Conformance
3D Graphics Compression Model Conformance
Image Signature Tools Conformance
Photo Player MAF Conformance
Musical Slide Show MAF Conformance
Professional Archival MAF Conformance
Video Surveillance MAF Conformance
Video Tool Library Conformance
MXM Conformance
Maintenance
Systems coding standards
Video coding standards
Audio coding standards
3DG coding standards
Visual description coding standards
Audio description coding standards
MPEG-21 standards
MPEG-A standards
Work plan and time line
Organisation of this meeting
24
1
2
10
1
2
3
4
5
6
7
1
2
3
4
8
9
11
1
2
3
12
13
14
Tasks for subgroups
Joint meetings
WG management
Terms of reference
Officers
Editors
Liaisons
Work item assignment
Ad hoc groups
Asset management
Reference software
Conformance
Test material
URI
IPR management
Work plan
Administrative matters
Responses to National Bodies
Schedule of future MPEG meetings
Promotional activities
Resolutions of this meeting
A.O.B
Closing
25
Annex C – Input contributions
Number
Source
m15944 Webmaster
m15945
Francisco Morán Burgos,
Patrick Gioia
m15946 Yi-Shin Tung, Teruhiko Suzuki
Title
Lausanne document register
Ad Hoc Group on 3DGC documents, software
maintenance and core experiments
Ad Hoc Group on Maintenance of MPEG-4 Visual
related Documents, Reference Software and
Conformance
m15947
Euee S. Jang, Marco Mattavelli,
Ad Hoc Group on Reconfigurable Video Coding
Kazuo Sugimoto
m15948
Miroslaw Bober, Paul Brasnett,
Ryoma Oami
Ad Hoc Group on MPEG-7 Visual
m15949
Hideaki Kimata, Aljoscha
Smoli, Anthony Vetro
Ad Hoc Group on 3D Video and FTV Coding
Jens-Rainer Ohm, Jörn
m15950 Ostermann, Ajay Luthra, Jason
Suh, T.K. Tan
m15951
Filippo Chiariglione, Marius
Preda
Ad Hoc Group on High-Performance Video Coding
Ad Hoc Group on MxM
m15952 R. Sperschneider
Ad Hoc Group on Audio Standards Maintenance
m15953 S. Quackenbush, P. Philippe
Ad Hoc Group on SAOC, USAC
m15954
Jean Gelissen, Marius Preda,
Keiji Mitsubuchi
Ad Hoc Group on Information Exchange with Virtual
Worlds
m15955
Young-Kwon Lim, Jaeyeon
Song, Cyril Concolato
Ad Hoc Group on Scene Representation
m15956 David Singer
Ad Hoc Group on MPEG File Formats
Kyuheon Kim, Hui Yong Kim,
m15957 Jean Cha, Noboru Harada,
Hendry
Ad Hoc Group on Application Format
m15958
Sanghyun Joo, Jean Gelissen,
Christian Timmerer
Ad Hoc Group on the RoSE Framework
m15959 Xin Wang, Young Kwon Lim
Ad Hoc Group on Advanced IPTV Terminal
m15960 Vladimir Levantovsky
Ad Hoc Group on Font Format Representation
m15961 SC 29 Secretariat
Summary of Voting on ISO/IEC 138184:2004/Amd.2:2005/DCOR 2 [SC 29 N 9874]
m15962 SC 29 Secretariat
Summary of Voting on ISO/IEC 13818-7:2006/DCOR
1 [SC 29 N 9875]
26
m15963 SC 29 Secretariat
Summary of Voting on ISO/IEC 144963:2005/Amd.2:2006/DCOR 4 [SC 29 N 9876]
m15964 SC 29 Secretariat
Summary of Voting on ISO/IEC 144963:2005/Amd.3:2006/DCOR 2 [SC 29 N 9877]
m15965 SC 29 Secretariat
Summary of Voting on ISO/IEC 14496-4:2004/DCOR
6 [SC 29 N 9878]
m15966 SC 29 Secretariat
Summary of Voting on ISO/IEC 144964:2004/Amd.13:2007/DCOR 2 [SC 29 N 9879]
m15967 SC 29 Secretariat
Summary of Voting on ISO/IEC 14496-15:2004/DCOR
3 [SC 29 N 9880]
m15968 SC 29 Secretariat
Summary of Voting on ISO/IEC 14496-20:200X 2nd
Edition/PDAM 2 [SC29 N 9881]
m15969 SC 29 Secretariat
Summary of Voting on ISO/IEC 23000-9:2008/PDAM
1 [SC 29 N 9882]
m15970 ITTF via SC 29 Secretariat
Table of Replies on ISO/IEC FDIS 23000-8 [SC 29 N
9910]
m15971 ITTF via SC 29 Secretariat
Table of Replies on ISO/IEC FDIS 15938-12 [SC 29 N
9911]
m15972 ITTF via SC 29 Secretariat
Table of Replies on ISO/IEC 21000-7:2007/FDAM 1
[SC 29 N 9912]
m15973 SC 29 Secretariat
Summary of Voting on ISO/IEC 14496-3:2005/DCOR
6 [SC 29 N 9913]
m15974 SC 29 Secretariat
Summary of Voting on ISO/IEC 144963:2005/Amd.9:2008/DCOR 1 [SC 29 N 9914]
m15975
IEC TC 100 via SC 29
Secretariat
IEC CDV 62546 [SC 29 N 9917]
m15976 ITTF via SC 29 Secretariat
Table of Replies on ISO/IEC 14496-5:2001/FDAM 18
[SC 29 N 9928]
m15977 SC 29 Secretariat
Summary of Voting on ISO/IEC 2300010:200X/PDAM 1 [SC 29 N 9929]
m15978
ITU-R SG 6 via SC 29
Secretariat
Liaison Statement from ITU-R SG 6 [SC 29 N 9930]
m15979
ITU-R SG 6 via SC 29
Secretariat
Liaison Statement from ITU-R SG 6 [SC 29 N 9931]
m15980 SC 29 Secretariat
Summary of Voting on ISO/IEC 14496-4:2004/FPDAM
34
m15981 SC 29 Secretariat
Summary of Voting on ISO/IEC 14496-5:2001/PDAM
15
m15982 SC 29 Secretariat
Summary of Voting on ISO/IEC 14496-4:2004/FPDAM
33 [SC 29 N 9948]
27
m15983 SC 29 Secretariat
Summary of Voting on ISO/IEC 14496-5:2001/FPDAM
22 [SC 29 N 9949]
m15984 SC 29 Secretariat
Summary of Voting on ISO/IEC FCD 14496-22 [2nd
Edition]
m15985 ITTF via SC 29 Secretariat
Table of Replies on ISO/IEC 13818-1:2007/FDAM 3
m15986 ITTF via SC 29 Secretariat
Table of Replies on ISO/IEC 14496-4:2004/FDAM 30
m15987 ITTF via SC 29 Secretariat
Table of Replies on ISO/IEC 14496-5:2001/FDAM 20
m15988 SC 29 Secretariat
Summary of Voting on ISO/IEC 13818-1:2007/PDAM
4
m15989 SC 29 Secretariat
Summary of Voting on ISO/IEC 14496-3:2005/PDAM
10
m15990 SC 29 Secretariat
Summary of Voting on ISO/IEC 14496-4:2004/PDAM
37
m15991 SC 29 Secretariat
Summary of Voting on ISO/IEC 14496-4:2004/PDAM
39
m15992 SC 29 Secretariat
Summary of Voting on ISO/IEC 14496-5:2001/PDAM
25
m15993 SC 29 Secretariat
Summary of Voting on ISO/IEC 14496-15:2004/PDAM
3
m15994 SC 29 Secretariat
Summary of Voting on ISO/IEC 15938-12:2008/DCOR
1
m15995 SC 29 Secretariat
Summary of Voting on ISO/IEC CD 21000-19
m15996 SC 29 Secretariat
Summary of Voting on ISO/IEC 23000-6:200X/PDAM
1
m15997 ITTF via SC 29 Secretariat
Table of Replies on ISO/IEC 14496-16:2006/FDAM 2
m15998 SC 29 Secretariat
Summary of Voting on ISO/IEC 14496-4:2004/PDAM
38
m15999 SC 29 Secretariat
Summary of Voting on ISO/IEC 144965:2001/Amd.10:2007/DCOR 3
m16000 SC 29 Secretariat
Summary of Voting on ISO/IEC 1449620:200X/PDAM 3
m16001 SC 29 Secretariat
Summary of Voting on ISO/IEC 14496-12:200X/DCOR
2 & ISO/IEC 15444-12:200X/DCOR 2
m16002 SC 29 Secretariat
Summary of Voting on ISO/IEC 1449610:200X/PDAM 1
m16003 W3C via SC 29 Secretariat
Liaison Statement from W3C [SC 29 N 9930]
m16004
IEC TC 100 via SC 29
Secretariat
m16005 TTA via SC 29 Secretariat
IEC CDV 61937-11 [SC 29 N 9951]
Liaison Statement from TTA [SC 29 N 9959]
28
m16006 SC 24 via SC 29 Secretariat
m16007
IEC TC 100 via SC 29
Secretariat
ISO/IEC FCD 19775-2.2 2nd Edition [SC 29 N 9958]
IEC CD 62455 [SC 29 N 9960]
m16008 Leonardo Chiariglione
Use cases for consideration by Ad Hoc Group on
Advanced IPTV Terminal
m16009 Leonardo Chiariglione
Technologies for consideration by Ad Hoc Group on
Advanced IPTV Terminal
m16010 Walter Allasia
Peer-to-Peer iDRM
m16011
Filippo Chiariglione
,Tiejun Huang
Web, Internet and Mobile TV
m16012 Leonardo Chiariglione
The Digital Media in Italia proposal
m16013 Lucia Marchisio
Open IPTV Platform For an Open Content Market
m16014 Young-Kwon LIM
Approaching the Zettabyte Era
m16015 Young-Kwon LIM
Contribution to the scope of the planned Advanced
IPTV Terminal standard
m16016 SC 29 Secretariat
Establishment of JTC 1/Study Group on Digital Content
Management and Protection
m16017 Leonardo Chiariglione
The MPEG Vision
Shun-ichi Sekiguchi
Yoshihisa Yamada
m16018 Yoshiaki Kato
Kohtaro Asai
Tokumichi Murakami
Response to call for test materials for HVC study
Shun-ichi Sekiguchi
Shuichi Yamagishi
Yoshihisa Yamada
m16019
Yoshiaki Kato
Kohtaro Asai
Tokumichi Murakami
On coding efficiency with extended block size for
UHDTV
m16020 IFPI via SC 29 Secretariat
Liaison Statement from IFPI [SC 29 N 9995]
m16021 Gangyi Jiang
Depth Map Compression for View Synthesis in FTV
m16022 Japan National Body
JNB comment on the resolution 3.5.4
m16023
Simon Daniels
Vladimir Levantovsky
m16024 Christian Timmerer
m16025
Sergio Arnaldo
Francisco Morán Burgos
Olgierd Stankiewicz
m16026 Krzysztof Wegner
Krzysztof Klimaszewski
Proposal for a new work item for ISO/IEC 14496-22
MPEG Representation of Sensory Effects Vision
Corrections to "WD3.0 of ISO/IEC 14496-16 AMD4,
Scalable Complexity 3D Mesh Coding"
Results of 3DV/FTV Exploration Experiments,
described in w10173, for Alt Moabit sequence.
29
m16027
Krzysztof Wegner
Olgierd Stankiewicz
Analysis of sub-pixel precision in Depth Estimation
Reference Software and View Synthesis Reference
Software
m16028
Olgierd Stankiewicz
Krzysztof Wegner
Application of Middle Level Hypothesis algorithm for
improvement of depth maps produced by Depth
Estimation Reference Software.
m16029
IEC TC 100 via SC 29
Secretariat
Liaison Statement from IEC TC 100 [SC 29 N 10025]
m16030
Yi-Shin Tung
Hwa Seon Shin
Editor's Input on Study Text of ISO/IEC FCD 23002-4
Hwa Seon Shin
Sowon Kim
Minsoo Park
m16031 Hyungyu Kim
Sinwook Lee
Byeongho Choi
Euee S. Jang
Revised FU Network and Tokens for MPEG-4 SP
m16032 Singapore National Body
SGNB Comments on Multiview Video Coding Profile
Gwo Giun Lee
Jia-wei Liang
m16033
He-Yuan Lin
Ming-Jiun Wang
Functional unit of AVC deblocking filter with MBAFF
Fons Bruls
Lincoln Lobo
m16034
Yin Zhao
Lu Yu
Basic LDV view-synthesis/renderer SW : LDVS
m16035
TK Tan
Yoshinori Suzuki
Noboru Harada
Tilman Liebchen
m16036
Takehiro Moriya
Yutaka Kamamoto
m16037
Kangchan Lee
Seungyun Lee
Kyungmo Park
Cyril Concolato
m16038
Jean Le Feuvre
Giovanni Cordara
m16039
Wonsuk Lee
Seungyun Lee
Yin Zhao
m16040 Deliang Fu
Lu Yu
Response to Call for Test Materials for HighPerformance Video Coding Standards Development
Proposed Text of ISO/IEC 144963:2005/Amd.2:2006/DCOR4
Proposal of MXM Ontology for inter-MXM
communication protocols
Items under considerations in Rich UI Framework
Proposal for new APIs of video metadata on MXM
APIs
LDV Virtual View Rendering Software
30
Fons Bruls
Lincoln Lobo
Yin Zhao
Deliang Fu
m16041
Lu Yu
Lianhuan Xiong
Temporal Improvement Method in View Synthesis
Yin Zhao
m16042 Deliang Fu
Lu Yu
3DV EE3 Report on Champagne_tower Sequences
m16043
Yin Zhao
Lu Yu
3DV EE4 Report on Dog Sequences
m16044 Mohamad Raad
Comment on the unified speech and audio coding
activity
m16045 Mohamad Raad
comment on the exploration on metadata driven postprocessing of audio
Carmen CHENG
m16046 Yan HUO
Yu LIU
3DV EE3 results on Dog sequence
Carmen CHENG
m16047 Yan HUO
Yu LIU
3DV EE4 results on Dog sequence
Hui Yuan
Yilin Chang
Haitao Yang
m16048
Xiaoxian Liu
Sixin Lin
Lianhuan Xiong
Depth Estimation Improvement for Depth Discontinuity
Areas and Temporal Consistency Preserving
Xiaoxian Liu
Yingying Guo
Haitao Yang
m16049 Junyan Huo
Yilin Chang
Sixin Lin
Lianhuan Xiong
3DV/FTV EE3/EE4 Results on Alt Moabit sequence
Siping Tao
Ying Chen
m16050
Miska M. Hannuksela
Houqiang Li
Depth Map Coding Quality Analysis for View Synthesis
Seungju Han
Hyunjeong Lee
m16051
Jae-Joon Han
jeong-hwan ahn
Full motion control and navigation of avatar/object with
multi-input sources in MPEG-V
m16052
Woo-Jin Han
JeongHoon Park
Samsung's response to Call for Test Materials for
MPEG HVC standardization
31
IlKoo Kim
Tammy Lee
Ken McCann
m16053
Ivana Radulovic
Per Fröjdh
3DTV Exploration Experiments on Pantomime
sequence
m16054 Stefan Doehla
Scalable Audio and MP4
Kihyun Choo
m16055 Junghoe Kim
Eunmi Oh
Comments on WD of Unified Speech and Audio
Coding
m16056
Markus Schnell
Per Ekstrand
Proposed Draft Corrigendum on AAC-ELD
Novel approaches to remote display representations:
BiFS-based solution and its deployment within the FP7
MobiThin project
m16057
Françoise PRETEUX
Mihai MITREA
Pieter SIMOENS
Bojan JOVESKI
m16058
Bert VANKEIRSBILCK
Abdeslam TAGUENGAYTE
Françoise PRETEUX
Novel approaches to remote display representations:
BiFS-based solution and its deployment within the FP7
MobiThin project
Sehoon Yea
m16059 Zafer Arican
Anthony Vetro
Results of Exploration Experiments in 3D Video for
Lovebird2
m16060
Cheon Lee
Yo-Sung Ho
EE1: Depth Estimation Results on 'Pantomime?
Sequence
m16061
Cheon Lee
Yo-Sung Ho
EE2: View Synthesis Results on 'Pantomime? Sequence
m16062
Cheon Lee
Yo-Sung Ho
EE4: Coding Results on 'Pantomime? Sequence
Sang-Beom Lee
m16063 Cheon Lee
Yo-Sung Ho
m16064
Cheon Lee
Yo-Sung Ho
Cheon Lee
Jae-Il Jung
m16065
Yun-Suk Kang
Yo-Sung Ho
m16066
Takanori Senoh
Kenji Yamamoto
Experimental Results on Improved Temporal
Consistency Enhancement
Implementation of Boundary Noise Removal for View
Synthesis
Additional Test Sequence for 3D Video
Report of 3DV/FTV Exploration E xperiments with
Champagne Tower
32
Ryutaro Oi
Tomoyuki Mishina
Makoto Okui
Gun Bang
Gi Mun Um
m16067
Namho Hur
Jinwoong Kim
3DV/FTV EE results of Depth Estimantion and View
Synthesis on "lovebird1" sequence
Gun Bang
Gwang sin Cho
m16068 Namho Hur
Jinwoong Kim
Donggyu Sim
3DV/FTV EE4 result of Coding Experiment on "Dog"
sequence
m16069
Steffen Kamp
Mathias Wien
Fast Decoder Side Motion Vector Derivation with
Candidate Scaling for Improving AVC Compression
Performance
Gun Bang
Jaeho Lee
m16070
Namho Hur
Jinwoong Kim
The consideration of the imrpoved depth estimation
algorithm
m16071 Andy Tescher for USNB
Response to resolution 3.5.4 of 86-th WG 11 meeting
m16072 Jean H.A. Gelissen (ed)
MPEG-V CfP Response
Seo-Young Hwang
Jaeyeon Song
m16073
Young-Kwon Lim
Jean Le Feuvre
Comment on Study text of 14496-20 PDAM2
Seo-Young Hwang
m16074 Jaeyeon Song
Young-Kwon Lim
Improvement of parsingSwitch on 14496-20 PDAM2
Seo-Young Hwang
m16075 Jaeyeon Song
Young-Kwon Lim
Service Scenario examples on 14496-20 PDAM2
Hyungyu Kim
Sinwook Lee
Hwa Seon Shin
m16076
Sowon Kim
Minsoo Park
Euee S. Jang
Comments on ISO/IEC 23001-4 FCD 2
Hyungyu Kim
m16077 Sinwook Lee
Euee S. Jang
Update proposal on the Vision of RVC
Hui Yong Kim
Myung Seok Ki
m16078
HanKyu Lee
Houari Sabirin
Updated text, conf. files, and ref. sw for ISO/IEC
23000-9 (DMB-AF)
33
Munchurl Kim
Jung Soo Lee
Yong Han Kim
Herve Taddei
m16079 Minjie Xie
Qing Zhang
Discussion on the Unified Speech and Audio Coding
Activity
m16080 Jean H.A. Gelissen (ed)
MPEG-V CfP Response
Inseon Jang
m16081 Huiyong Kim
Jeongil Seo
Study text of ISO/IEC 23000-12 WD Interactive music
application format
Tomonobu Yoshino
m16082 Sei Naito
Shigeyuki Sakazawa
Preliminary response for Draft Call for Evidence on
High Performance Video Coding
m16083 WG 1 via SC 29 Secretariat
Liaison Statement from SC 29/WG 1
Kwang-Ki Kim
Jeongil Seo
m16084 Seungkwon Beack
Kyeongok Kang
MinsooHahn
CE on Residual Coding Process for Post Downmix Gain
m16085
jean Le Feuvre
Cyril Concolato
Zhong Haishan
Zhou Huan
m16086 Chong Kok Seng
Tomokazu Ishikawa
Takeshi Norimatsu
Comments on LASeR PDAM2
Efficient inter-object relation indicator for SAOC
m16087
Patrick Lopez
Dong Tian
3DV/FTV EE3 : LeavingLaptop and Lovebird1
m16088
Patrick Lopez
Dong Tian
3DV/FTV EE4 : Dog sequence
m16089 Teruhiko Suzuki
Comments on 14496-12:200X FPDAM1
Masayuki Tanimoto
m16090 Toshiaki Fujii
Kazuyoshi Suzuki
View Synthesis Algorithm in View Synthesis Reference
Software 2.0 (VSRS2.0)
Masayuki Tanimoto
m16091 Toshiaki Fujii
Kazuyoshi Suzuki
View Synthesis Method without Blending
Masayuki Tanimoto
m16092 Toshiaki Fujii
Kazuyoshi Suzuki
Depth Estimation Reference Software (DERS) with
Image Segmentation and Block Matching
m16093
Masayuki Tanimoto
Toshiaki Fujii
Data Format for FTV
34
Kazuyoshi Suzuki
m16094
Mejdi Trimeche
Miska M Hannuksela
Results of 3D Video Coding Experiments EE1 and EE2
for Dog Data Set
Jonas Engdegård
Heiko Purnhagen
Oliver Hellmuth
Johannes Hilpert
Maria Luis Valero
m16095
Andreas Hölzer
Markus Schnell
Leonid Terentiev
Erik Schuijers
Per Ekstrand
Information regarding CE on Low Delay MPEG SAOC
Jonas Engdegård
Heiko Purnhagen
m16096 Oliver Hellmuth
Leonid Terentiev
Erik Schuijers
Information regarding CE on Low Power MPEG SAOC
Cornelia Falch
Leonid Terentiev
m16097
Johannes Hilpert
Oliver Hellmuth
Information regarding mixing mode for the enhanced
Karaoke/Solo processing
Leonid Terentiev
m16098 Cornelia Falch
Oliver Hellmuth
Proposal for MCU functionality extension for the
MPEG SAOC
Jonas Engdegård
Heiko Purnhagen
Cornelia Falch
Leonid Terentiev
Andreas Hölzer
m16099
Oliver Hellmuth
Johannes Hilpert
Yang-Won Jung
Henney Oh
Jeroen Koppens
Report on corrections for the MPEG SAOC FCD text
and RM software
Heiko Purnhagen
Cornelia Falch
Leonid Terentiev
Oliver Hellmuth
m16100
Johannes Hilpert
Yang-Won Jung
Henney Oh
Jeroen Koppens
Proposal for dynamic preset extension for the MPEG
SAOC
m16101
Shinya Shimizu
Hideaki Kimata
m16102 Zhuangfei Wu
3DV/FTV EE Report on Doorflower sequence
Updates to the MVC File Format
35
Per Fröjdh
m16103
Yang-Won Jung
Henney Oh
Consideration on User Interface in SAOC
m16104
Yang-Won Jung
Henney Oh
Proposal for adding information on object
characteristics in SAOC
m16105
Yang-Won Jung
Henney Oh
Proposal for including guideline information on the
rendering parameters in SAOC
m16106
Yang-Won Jung
Henney Oh
Comments on the enhanced karaoke mode in SAOC
Pierrick Philippe
m16107 Gregory Pallone
Marc Emerit
Proposed Audio Sequences for MPEG-D SAOC
m16108 Matthias Gruhne
Study on ISO/IEC TR 15938-8:2002/FPDAM 4
Shangwen Li
m16109 Lu Yu
Lianhuan Xiong
Second Order Prediction of Video Coding
m16110 Werner Oomen
Comment on the unified speech and audio coding
activity
m16111 Tomoo Yamakage
Potential Corrigendum Items for MPEG-2 Systems
m16112
Sangil Na
DongSeok Jeong
Proposal to remoe pair in comparison pair list for
indpendence test
m16113
WonGeun Oh
JuKyong Jin
Ground true table & incorrect video query clips of
AVC, CC
m16114
Miska M. Hannuksela
Ying Chen
On MVC File Format
m16115 Miska M. Hannuksela
On FPDAM1 of ISO Base Media File Format
m16116 S. Quackenbush
86th MPEG Audio Report
Frans de Bont
Stefan Döhla
m16117
Heiko Purnhagen
Alexander Gröschel
Thoughts on MPEG Surround signaling
Hyun-Kook Lee
Dong Soo Kim
m16118
Sungyong Yoon
Jaehyun Lim
Considerations on the development of common USAC
reference encoder
Dong Soo Kim
Sungyong Yoon
m16119
Hyun-Kook Lee
Jaehyun Lim
Proposed syntax revision on USAC RM0
m16120
Dong Soo Kim
Sungyong Yoon
Proposed syntax revision regarding SBR bitstream on
USAC RM0
36
Hyun-Kook Lee
Jaehyun Lim
Heiko Purnhagen
m16121 Jeroen Koppens
Matthias Neusinger
Further corrections to MPEG Surround text
Dong Soo Kim
Sungyong Yoon
m16122
Hyun-Kook Lee
Jaehyun Lim
Efficient signaling for FD frame on USAC RM0
Dong Soo Kim
Sungyong Yoon
m16123
Hyun-Kook Lee
Jaehyun Lim
Comment on random access issue on USAC RM0
Heiko Purnhagen
Jeroen Koppens
m16124
Claus-Christian Spenger
Matthias Neusinger
Corrections to MPEG Surround reference software
Dong Soo Kim
Sungyong Yoon
m16125
Hyun-Kook Lee
Jaehyun Lim
Proposed syntax revision regarding window sequence
on USAC RM0
Jaime Delgado
Eva Rodríguez
Víctor Rodríguez-Doncel
m16126
Silvia Llorente
Rubén Barrio
Víctor Torres
DMAG-UPC Comments on WD2.0 of MXM API
Ralf Geiger
Fabian Haussel
m16127
Michael Haertl
Virgilio Bacigalupo
Proposed Corrigendum on MPEG-4 SLS Conformance
Victor Rodriguez-Doncel
m16128 Jaime Delgado
Ruben Tous
Presentation of the W3C MAWG Activities
Y. Wang
K. Müller
m16129
P. Merkle
A. Smolic
Results of Exploration Experiments in 3D Video
Coding for Dog Data Set
Aljoscha Smolic
Karsten Mueller
m16130
Peter Kauff
Thomas Wiegand
Considerations about the Vision of a 3D Video Standard
Laurent Primaux
m16131 Owen Lagadec
Emmanuel Bouix
Report of Mini Experiment on IM AF Constraints
representation
37
Fabien Gallot
Inseon Jang
Hui Yong Kim
Jeongil Seo
Kyeongok Kang
Laurent Primaux
Owen Lagadec
m16132
Emmanuel Bouix
Fabien Gallot
Constraints Specifications for IM AF
Next generation Broadcasting
m16133
Forum(Korea)
Proposed Text for WD of ISO/IEC 23000-11
Stereoscopic Video AF Conformance and Reference
Software
Laurent Primaux
Owen Lagadec
Emmanuel Bouix
Fabien Gallot
m16134
Inseon Jang
Hui Yong Kim
Jeongil Seo
Kyeongok Kang
Constraints representation method for IM AF
m16135
Fons Bruls
Lincoln Lobo
delete
Hussein Aman-Allah
m16136 Ihab Amer
Marco Mattavelli
An AVC Entropy Coding Module for the MPEG RVC
VTL
Ehab Asaad Hanna
m16137 Ihab Amer
Marco Mattavelli
An AVC Motion estimation Module for the MPEG
RVC VTL
Karim Maarouf
m16138 Ihab Amer
Marco Mattavelli
An AVC Intra Prediction Module for the MPEG RVC
VTL
m16139
Fons Bruls
Lincoln Lobo
Philips 3DV EE1,2,3,4 results
m16140
Kristofer Kjörling
Heiko Purnhagen
Core Experiment procedures and MPEG reference
software encoder
m16141
Kristofer Kjörling
Andreas Schneider
Proposal for splitting the current AAC family profiles
into two
Lars Villemoes
m16142 Per Ekstrand
Kristofer Kjörling
Core experiment proposal on the USAC eSBR module
m16143 Pierrick Philippe
Proposed improvements for MPEG Audio Core
Experiment Methodology and Reference Software
Development
38
Stephan Schreiner
m16144 Wolfgang Fiesel
Akshaya Thippur
Perspectives on Application Scenarios for PostProcessing Audio Metadata
Matthieu Wipliez
m16145 Mickael Raulet
Jean-François Nezan
Proposed changes for RVC-CAL annex A of ISO-IEC
23001-4
Roch Lefebvre
m16146 Philippe Gournay
Redwan Salami
Comments on Core Experiments methodology for
MPEG USAC standardisation
Philippe Gournay
Bruno Bessette
m16147
Roch Lefebvre
Redwan Salami
Proposed Core Experiment on LPC Quantization for
USAC
Khaled Mamou
Titus Zaharia
m16148
Marius Preda
Françoise Preteux
Attributes Encoding for TFAN
Benoit Le Bonhomme
m16149 marius.preda@int-evry.fr
Françoise Preteux
Scalable Complexity Mesh Coding Benchmark
Benoit Le Bonhomme
m16150 marius.preda@int-evry.fr
Françoise Preteux
MMW.com API extension for 3D graphics attributes
Ivica Arsov
m16151 marius.preda@int-evry.fr
Françoise Preteux
MXM API for 3D Graphics content creation
Blagica Jovanova
m16152 marius.preda@int-evry.fr
Françoise Preteux
Selecting elementary streams in MP25 RefSoft
Max Neuendorf
Philippe Gournay
Jérémie Lecomte
Markus Multrus
m16153
Stefan Bayer
Guillaume Fuchs
Ralf Geiger
Frederik Nagel
Proposed Corrections to WD and Reference Software
on Unified Speech and Audio Coding
Jérémie Lecomte
Max Neuendorf
m16154
Ralf Geiger
Markus Multrus
Proposed Update on USAC Bitstream Syntax
m16155 Christian Timmerer
On WD 1.0 of ISO/IEC 21000-2:2005 AMD1 (PSI)
m16156
Taejin Lee
Max Neuendorf
Progress of Technology Merge Between System 2 and
USAC RM
39
Jeremie Lecomte
Kyeongok Kang
Bernhard Grill
m16157
Markus Waltl
Christian Timmerer
Minor Corrections to RoSE WD 2.0 XML Schema
m16158 Christian Timmerer
Updates for the MPEG Extensible Middleware
Maria Teresa Andrade
Vítor Barbosa
Anna Carreras
Pedro Carvalho
Giovanni Cordara
Jaime Delgado
Safak Dogan
Frederic Dufaux
Touradj Ebrahimi
Gianluca Francini
Isabel Gallego
m16159 Lutz Goldmann
Ivan Ivanov
Thien Ha Minh
Shan Jin
Hemantha Kodikara Arachchi
Gelareh Mohammadi
Marta Mrak
Adam Pietrowcew
Toni Rama
Eva Rodríguez
Thomas Sikora
Rubén Tous
Extended template for the Advanced Surveillance AF
Bernhard Grill
Jürgen Herre
m16160 Ralf Geiger
Max Neuendorf
Markus Multrus
Thoughts on Core-Experiment Methodology
Maria Teresa Andrade
Vítor Barbosa
Anna Carreras
Pedro Carvalho
Giovanni Cordara
Jaime Delgado
Safak Dogan
m16161
Frederic Dufaux
Touradj Ebrahimi
Gianluca Francini
Isabel Gallego
Lutz Goldmann
Ivan Ivanov
Thien Ha Minh
Contribution to the Advanced Surveillance AF
40
Shan Jin
Hemantha Kodikara Arachchi
Gelareh Mohammadi
Marta Mrak
Adam Pietrowcew
Toni Rama
Eva Rodríguez
Thomas Sikora
Rubén Tous
m16162
Guillaume Fuchs
Markus Multrus
Eva Rodríguez
m16163 Jaime Delgado
Isabel Gallego
m16164
Ihab Amer
Marco Mattavelli
Proposed Update of Arithmetic Coder Tables for USAC
Fragments governance for the Advanced Surveillance
AF
An MPEG Fixed Point IDCT Module for the RVC VTL
Fons Bruls
m16165 Lincoln Lobo
Wiebe de Haan
On addressing market 3D developments, Stereo &
MPEG 3DV activity.
m16166 Mauri Väänänen
On the Unified Speech and Audio Coding Activity
m16167
Markus Waltl
Christian Timmerer
Updates and Additional Tools for MPEG RoSE
Christian Timmerer
Mark Stuard
m16168
Franc Kozamernik
Jari Ahola
Use cases and Requirements for Advanced Internet TV
Terminals
Jin-Seo Kim
Maeng-Sub Cho
m16169 Bon-Ki Koo
Yong Soo Joo
Sang-Kyun Kim
A simple RoSE system implementation including SDC,
USP, and SDCom
Jin-Seo Kim
Maeng-Sub Cho
m16170 Bon-Ki Koo
Yong Soo Joo
Sang-Kyun Kim
A demonstration for reference color type and its
parameters in RoSE
m16171
Kota Iwamoto
Ryoma Oami
Response to the Call for Proposals on Video Signature
Tools
Paul Brasnett
m16172 Stavros Paschalakis
Miroslaw Bober
Response to the Call for Proposals on Video Signature
Tools
JungHoe Kim
m16173 Julien Robilliard
Eunmi Oh
Progress report on phase experiment for USAC
41
Bernhard Grill
m16174 Jean H. A. Gelissen (Ed)
MPEG-V CfP Response
Dong Tian
m16175 Po-Lin Lai
Patrick Lopez
3DV EE1 & EE2 on Leaving_Laptop and
Improvements in ViSBD 2.1
Houari Sabirin
Hendry
m16176
Noboru Harada
Munchurl Kim
Status report on ISO/IEC 23000-6 Professional Archival
Application Format Reference Software and
Conformance files
m16177
Hosang Sung
Eunmi Oh
Progress report on unvoiced speech coding
m16178 Jens-Rainer Ohm
Received responses to CfP on Video Signature Tools
Cyril Concolato
m16179 Jean Le Feuvre
Benoit Pellan
Comments on requirements for a new BIFS profile
m16180 David Singer
On the MVC File format (14496-15 amendment)
m16181 David Singer
Errata report for 14496-12:2005 (ISO Base Media File
Format)
m16182 David Singer
On Movie Fragments, Edit Lists, and other timing
questions, for 14496-12 (ISO Base Media File Format)
m16183 Filippo Chiariglione
Proposed WD3.0 of MxM Architecture and
Technologies
m16184 Filippo Chiariglione
Proposed WD3.0 of MxM APIs
m16185 Filippo Chiariglione
Proposed WD2.0 of MxM Ref. SW. and Conf.
m16186 Filippo Chiariglione
Proposed WD2.0 of 2nd edition of ISO/IEC 29116-1
(MXM Protocols)
m16187 Patrick Gioia
MXM use-case proposals for 3D services
m16188 Leonardo Chiariglione
Proposal of Advanced IPTV Terminal (AIT)
requirements
Jaewon Sung
m16189 Yong-Joon Jeon
Byeong-Moon Jeon
3DV EE1 and EE2 Results on Newspaper Sequence
Jaewon Sung
m16190 Yong-Joon Jeon
Byeong-Moon Jeon
3DV EE4 Results on Pantomime Sequence
m16191
Yasuaki Tokumo
Shin-ya Hasegawa
Yasuaki Tokumo
m16192 Shin-ya Hasegawa
Takuya Iwanami
Study on Sensory Effect Metadata
Proposal for Sensory Effect Metadata
42
m16193 IDF via SC 29 Secretariat
m16194
World DMB Forum via SC 29
Secretariat
Liaison Statement from the International DOI
Foundation (IDF)
Liaison Statement from the World DMB Forum
Kyoungsoo Son
Seungwook Lee
m16195 Bonki Koo
Daiyong Kim
Euee S. Jang
An Explanation of SVA and QBCR En-Decoding
Algorithm
Seungwook Lee
Bonki Koo
m16196 Daiyong Kim
Kyoungsoo Son
Euee S. Jang
Bitstream Syntax and Semantics for QBCR and SVA
Seungwook Lee
Bonki Koo
m16197 Daiyong Kim
Kyoungsoo Son
Euee S. Jang
CE Report Version 3 on the SC3DMC
Daiyong Kim
Seungwook Lee
m16198
Kyoungsu Son
Preda Marius
A Report on the Conformance Test of 3D Graphics
Group
Seungwook Lee
Bonki Koo
m16199 Daiyong Kim
Kyoungsoo Son
Euee S. Jang
A Report on the Reference Software of SC3DMC
Miyoung Kim
m16200 Junghoe Kim
Eunmi Oh
Proposed BSAC Conformance Bitstreams for Terrestrial
DMB
B. S. Choi
SangHyun Joo
m16201
HaeRyong Lee
KwangRo Park
Comments and Proposal for Sensory Effect Metadata
Kei Kikuiri
m16202 Nobuhiko Naka
Kousuke Tsujino
Comments on USAC Standardization Activities
m16203 Sanghyun Joo
A proposal for RoSE system architecture
m16204 Aljoscha Smolic
delete
m16205 Aljoscha Smolic
delete
m16206 Aljoscha Smolic
delete
m16207 Kemal Ugur
Requirements for high-performance video standards
43
Justin Ridge
m16208 Mohamad Raad
Proposal for the development of a common MPEG
Audio encoder for use in the CE phase
Jungyoup Yang
Kwanghyun Won
m16209
Byeungwoo Jeon
Su Nyeon Kim
Motion Vector Coding with Optimal Predictor
m16210
Pierrick Philippe on behalf of
the FRNB
FRNB Comment on Video Coding
Sinwook Lee
Sowon Kim
m16211
Jeonghwan Ahn
Euee S. Jang
Source code for Interpolation Compression for MPEP-4
part 25
m16212 Thomas Davies
BBC 1080p50 test materials for HVC study
m16213 Korea National Body
Late KNB comment on 14496-20 PDAM2
Yongwei Zhu
Susanto Rahardja
m16214
Te Li
Haibin Huang
A proposal for streaming support for IM AF
m16215 Japan National Body
JNB Comment on High Performance Video Coding
Activity
m16216 P. Philippe
Subjective Evaluation of Low Delay MPEG SAOC
m16217
GRN Consortium via SC 29
Secretariat
Liaison Statement from GRN Consortium
m16218
IEC TC 100 via SC 29
Secretariat
IEC CDV 60958-3/Amd.1
Kohtaro Asai
m16219 Ryuta Suzuki
Shun-ichi Sekiguchi
Status of potential test materials for HVC with 4K or
higher resolutions
werner bailer
peter shallauer
m16220 mathias lux
walter allasia
francesco gallo
Proposal for APIs of image and video metadata on
MXM APIs
m16221 Sanghyun Joo
Disposition of MPEG RoSE within MPEG-V
m16222
Next generation Broadcasting
Forum(Korea)
Proposed Corrigendum on ISO/IEC 23000-11
Stereoscopic Video Application Format
m16223
JTC 1 SGSN via SC29
Secretariat
Liaison Statement from JTC 1 Study Group on Sensor
Network to SC 29
m16224
Ken McCann
Woo-Jin Han
Proposal on Focus for MPEG HVC standard
development
44
Jason Suh
m16225
Miska M. Hannuksela
Stefan Döhla
Miscellaneous comments on ISO/IEC 14496-12:2008
FPDAM1
m16226
ISO TC 223 via SC 29
Secretariat
Liaison Statement from ISO TC 223
m16227 SC 29 Secretariat
Informal Workshop on Videosurveillance
m16228 Mohamad Raad for AUNB
Comment on the discontinuation of JVT
Kohtaro Asai
m16229 Takuyo Kogure
Hiroshi Yasuda
Report of MPEG 20th anniversary commemoration
event
m16230 Jean Le Feuvre
FNB Comment on LASeR PDAM2
m16231 EDItEUR via SC 29 Secretariat
Liaison Statement from EDItEUR
Leon Denis
m16232 Dan Cernea
Munteanu
Updates concerning the MeshGrid compression
software
m16233
ITU-T SG 16 via SC 29
Secretariat
Liaison Statement from ITU-T SG 16
m16234 Gero Bäse for GNB
GNB on JVT matters
Gary J. Sullivan
Jens-Rainer Ohm
m16235
Thomas Wiegand
Ajay Luthra
Meeting Report of the 30th JVT Meeting (29 January 2 February 2009, Geneva, CH)
Gary J. Sullivan
Jens-Rainer Ohm
m16236
Thomas Wiegand
Ajay Luthra
Revised meeting report of the 27th JVT meeting (April
2008)
45
Annex D – Output documents
Number
Source
Title
w10311 Convener
List of Documents from the 87th Meeting in Lausanne, Switzerland
w10312 Convener
Resolutions of the 87th Meeting in Lausanne, Switzerland
w10313 Convener
List of AHGs Established at the 87th Meeting in Lausanne, Switzerland
w10314 Convener
Report of the 87th Meeting in Lausanne, Switzerland
w10315 Convener
Guidelines for Electronic Distribution of MPEG M and N Documents
w10316 Convener
Press Release of the 87th Meeting in Lausanne, Switzerland
w10317 Convener
Meeting Notice of the 88th Meeting in Maui, US
w10318 Convener
Guide for WG 11 Meeting Hosts
w10319 Convener
Liaison Statement to ITU-T SG 16
w10320 3DGC
DOCR on ISO/IEC 14496-4:2004/FPDAM 33 (Multi Resolution Profile
Conformance)
w10321 3DGC
DOCR on ISO/IEC 14496-4:2004/FPDAM 34 (3D Graphics Model
Conformance)
w10323 3DGC
DOCR on ISO/IEC 14496-5:2001/FPDAM 22 (3DGCM Reference
Software)
w10324 3DGC
Text of ISO/IEC 14496-5:2001/FDAM 22 (3DGCM Reference Software)
w10325 3DGC
Text ISO/IEC 14496-5:2001/FPDAM 25 (Scene Partitioning Reference
Software)
w10326 3DGC
Request for AMD
w10327 3DGC
WD1.0 of ISO/IEC 14496-5:2001/AMD 27 (SC3DMC RefSoft)
w10328 3DGC
Answer to liaison from W3D
w10329 3DGC
Text of ISO/IEC 14496-16:2006/PDAM4 (Scalable Complexity 3D Mesh
Compression)
w10330 3DGC
CE on Scalable Complexity 3D Mesh Compression
w10331 3DGC
WD of ISO/IEC 14496-16 3rd Edition
w10332 3DGC
Request for subdivision of ISO/IEC 14496-27
w10333 3DGC
Text of ISO/IEC 14496-27:200x/FDIS (3DG Conformance)
w10334 3DGC
Text of ISO/IEC 14496-27:200x/FPDAM1 (Scene partitioning
conformance)
w10335 3DGC
Text of ISO/IEC 14496-27:2009/PDAM2 (SC3DMC Conformance)
w10336 Convener
AHG on 3DGC documents, software maintenance and core experiments
w10337 Video
Disposition of Comments on ISO/IEC 14496-4:2004/PDAM38
46
w10338 Video
Text of ISO/IEC 14496-4:2004/FPDAM38 Multiview Video Coding
Conformance Testing
w10339 Video
Disposition of Comments on ISO/IEC 14496-5:2001/PDAM 15
w10340 Video
Text of ISO/IEC 14496-5:2001/FPDAM 15 Reference Software for
Multiview Video Coding
w10341 Video
Disposition of Comments on ISO/IEC 14496-10:200X/PDAM 1
w10342 Video
Text of ISO/IEC 14496-10:200X/FPDAM 1 Constrained Baseline Profile
and supplemental enhancement information
w10343 Video
Defect Report on ISO/IEC 14496-10:200X
w10344 Video
Working Draft 1 of ISO/IEC 14496-10:200X/Amd.2 Multiview Field High
Profile
w10345 Video
Description of Core Experiments in Video Signature Description
development
w10346 Video
Request for ISO/IEC 23000-3/Amd.2
w10347 Video
Text of ISO/IEC 23000-3/PDAM2 Conformance Testing for Photo Player
MAF
w10348 Video
Disposition of Comments on ISO/IEC FCD 23001-4
w10349 Video
Text of ISO/IEC FDIS 23001-4 Codec Configuration Representation
w10350 Video
Disposition of Comments on ISO/IEC FCD 23002-4
w10351 Video
Text of ISO/IEC FDIS 23002-4 Video Tool Library
w10352 Video
Request for ISO/IEC 23002-4/Amd.1
w10353 Video
Text of ISO/IEC 23002-4/PDAM1 Video Tool Library Conformance and
Reference Software
w10354 Video
WD 4 of ISO/IEC 23002-4/Amd.2 (Tools for MPEG-2 MP, MPEG-4 ASP,
AVC HP and SVC)
w10355 Video
RVC Work Plan and FU Development Status
w10356 Video
Description of Core Experiments in RVC
w10357 Video
Vision on 3D Video Coding
w10358 Video
Applications and Requirements of 3D Video Coding
w10359 Video
Call for 3D Test Material: Depth Maps & Supplementary Information
w10360 Video
Description of Exploration Experiments in 3D Video Coding
w10361 Video
Vision and Requirements for High-Performance Video Coding (HVC)
w10362 Video
Call for Test Materials for High-Performance Video Coding
Standardisation
w10363 Video
Draft Call for Evidence on High-Performance Video Coding
w10364 Convener
Liaison statement to ITU-R SG6 re Multi-view Video Coding
47
w10365 Convener
Response to National Bodies
w10366 Convener
Liaison statement to ITU-T SG16 re interlaced Multi-view Video Coding
w10367 Convener
AHG on Maintenance of MPEG-4 Visual related Documents, Reference
Software and Conformance
w10368 Convener
AHG on Reconfigurable Video Coding
w10369 Convener
AHG on MPEG-7 Visual
w10370 Convener
AHG on 3D Video Coding
w10371 Convener
AHG on High-Performance Video Coding
w10372 Convener
AHG on AVC Development
w10373 Audio
DoC on ISO/IEC 13818-4:2004/AMD 2:2005/DCOR 2, AAC
Conformance
w10374 Audio
ISO/IEC 13818-4:2004/AMD 2:2005/Cor 2, AAC Conformance
w10375 Audio
DoC on ISO/IEC 13818-7:2006/DCOR 1, AAC
w10376 Audio
ISO/IEC 13818-7:2006/Cor. 1, AAC
w10377 Audio
DoC on ISO/IEC 14496-3:2005/DCOR. 6, AAC
w10378 Audio
ISO/IEC 14496-3:2005/Cor. 6, AAC
w10379 Audio
DoC on ISO/IEC 14496-3:2005/AMD 2:2006/DCOR 4, HE-AAC V2
Profile and ALS
w10380 Audio
ISO/IEC 14496-3:2005/AMD 2:2006/Cor. 4, HE-AAC V2 Profile and
ALS
w10381 Audio
DoC on ISO/IEC 14496-3:2005/AMD 3:2006/ DCOR 2, SLS
w10382 Audio
ISO/IEC 14496-3:2005/AMD 3:2006/Cor. 2, SLS
w10383 Audio
DoC on ISO/IEC 14496-3:2005/AMD 9:2008/DCor. 1, AAC-ELD
w10384 Audio
ISO/IEC 14496-3:2005/AMD 9:2008/Cor. 1, AAC-ELD
w10385 Audio
ISO/IEC 14496-3:2009/FPDAM 1:200X HD-AAC Profile
w10386 Audio
Thoughts on MPEG Surround Signaling
w10387 Audio
ISO/IEC 14496-4:2004/Cor. 6, AAC-LD
w10388 Audio
ISO/IEC 14496-4:2004/DCOR 7, Removal of Audio and 3DG
Conformance
w10389 Audio
DoC on ISO/IEC 14496-4:2004/AMD13:200x/DCOR 2, AAC-LD
bitstreams
w10390 Audio
ISO/IEC 14496-4:2004/AMD13:200x/Cor. 2, AAC-LD bitstreams
w10391 Audio
DoC on ISO/IEC 14496-4:2004/PDAM 36, AAC-ELD, OAFI and
additional AAC Conformance
w10392 Audio
DoC on ISO/IEC 14496-5:2001/Amd.10:2007/DCOR 3, ALS and SLS
48
w10393 Audio
ISO/IEC 14496-5:2001/Amd.10:2007/COR 3, ALS and SLS
w10394 Audio
Request for Subdivision of 14496, Audio Conformance
w10395 Audio
ISO/IEC 14496-26:2009, Audio Conformance
w10396 Audio
ISO/IEC 14496-26:2009/DCOR 1, ALS and SLS updates
w10397 Audio
ISO/IEC 14496-26:2009/FPDAM 1, AAC-ELD, OAFI, additional AAC
and MPEG 1/2 on MPEG-4 Conformance
w10398 Audio
WD on additional BSAC conformance streams for broadcasting
w10399 Audio
DoC on ISO/IEC TR 15938-8:2002/PDAM 4, Extraction of audio features
from compressed formats
w10400 Convener
Terms of reference
w10401 Convener
MPEG Standards
w10402 Convener
Unpublished standards at FDIS level
w10403 Convener
MPEG work plan and time line
w10404 Convener
MPEG Standard Editors
w10405 Convener
Schema assets updates
w10406 Convener
Software assets
w10407 Convener
Conformance assets
w10408 Convener
Content assets
w10409 Convener
URI assets
w10410 Convener
Standards under development for which a call for patent statements is
issued
w10411 Convener
List of Organisations with which MPEG entertains liaisons
w10412 Convener
The MPEG Vision
w10413 Audio
ISO/IEC TR 15938-8:2002/DAM 4, Extraction of audio features from
compressed formats
w10414 Audio
ISO/IEC 23003-1:2007/DCOR 2, Misc. Corrections
w10415 Audio
ISO/IEC 23003-1:2007/AMD 2:2008/DCOR 1, Ref. Sw. Update
w10416 Audio
Study on ISO/IEC FCD 23003-2:200x, Spatial Audio Object Coding
w10417 Audio
Status and Workplan on SAOC Core Experiments
w10418 Audio
WD2 of USAC
w10419 Audio
Workplan for USAC CEs
w10420 Audio
MPEG Reference Encoder and the Audio CE Process
w10421 Audio
Workplan on MPEG Reference Encoder
w10422 Audio
Draft Revisions to MPEG Audio CE methodology
49
w10423 Audio
Thoughts on Efficient Bitstream Syntax
w10424 Convener
Response to DRM on MPEG-4 AAC Technology and Profiles
w10425 Convener
Response to ETSI/EBU/CENELEC JTC on MPEG-4 AAC Technology
and Profiles
w10426 Convener
Response to WorldDMB Forum on MPEG-4 AAC Technology and
Profiles
w10427 Convener
Response to IEC TC100/TA4 on IEC CDV 61937-11 and 60958-3/Amd.1
w10428 Convener
Response to AUNB Comments on USAC
w10429 Convener
Response to AUNB Comments on MetaData
w10430 Convener
Response to FR, FI and CN NB Comments on USAC
w10431 Convener
AHG on Audio Standards Maintenance
w10432 Convener
AHG on SAOC, USAC and MetaData
w10433 3DGC
Request for Amendment: 14496-27:2009/PDAM2 (SC3DMC
Conformance)
w10434 Audio
Issues concerning frame lengths in the AAC family profiles
w10435 Systems
DoC on ISO/IEC 13818-1:2007/PDAM4 Transport of MVC
w10436 Systems
Text of ISO/IEC 13818-1:2007/FPDAM4 Transport of MVC
w10437 Systems
WD 1.0 of ISO/IEC 13818-1:2007 DCOR X
w10438 Systems
Text of ISO/IEC 14496-4:2004/FPDAM 37 File Format Conformance
Improvements
w10439 Systems
WD 1.0 of ISO/IEC 14496-11:2002/AMD X New BIFS profile
w10440 Systems
DoC on ISO/IEC 14496-12:200X/DCOR 2 Usage of brands and box order
in sample entry
w10441 Systems
Text of ISO/IEC 14496-12:200X/COR 2 Usage of brands and box order in
sample entry
w10442 Systems
Study of ISO/IEC 14496-12:200X/FPDAM 1 General Improvements
w10443 Systems
Text of ISO/IEC 14496-15:2004/COR3
w10444 Systems
DoC on ISO/IEC 14496-15:2004/PDAM 3 MVC File Format
w10445 Systems
Text of ISO/IEC 14496-15:2004/FPDAM 3 MVC File Format
w10446 Systems
DoC on ISO/EC 14496-20:2008/PDAM 2 Adaptation
w10447 Systems
Text of ISO/EC 14496-20:2008/FPDAM 2 Adaptation
w10448 Systems
Workplan for service example of LASeR Adaptation & PSI
w10449 Systems
Clarification on the usage of ISO/IEC 14496-20 by other standardization
bodies
w10450 Systems
DoC on ISO/IEC 14496-22 FCD 2nd Edition
50
w10451 Systems
Text of ISO/IEC 14496-22 FDIS 2nd Edition
w10452 Systems
Text of ISO/IEC 15938-12:2008 /COR 1
w10453 Systems
WD2.0 of ISO/IEC 21000-2 AMD PSI
w10454 Systems
Draft DoC on ISO/IEC CD 21000-19 Media Value Chain Ontology
w10455 Systems
Draft Text of ISO/IEC FCD 21000-19 Media Value Chain Ontology
w10456 Systems
DoC on ISO/IEC 23000-6 PA-AF/PDAM 1 Conformance and Reference
Software
w10457 Systems
Text of ISO/IEC 23000-6 PA-AF/FPDAM 1 Conformance and Reference
Software
w10458 Systems
Workplan for ISO/IEC 23000-6 PA-AF Conformance and Reference
Software
w10459 Systems
Text of ISO/IEC 23000-9:2008/FPDAM1 DMB AF Conf. And Ref. Soft.
w10460 Systems
Workplan for DMB AF Conf. And Ref. Soft.
w10461 Systems
Request for ISO/IEC 23000-9:2008/PDAM2 DMB AF Harmonization of
MPEG-2 TS storage
w10462 Systems
Text of ISO/IEC 23000-9:2008/PDAM2 DMB AF Harmonization of
MPEG-2 TS storage
w10463 Systems
DoC on ISO/IEC 23000-10 PDAM1 Video Surveillance Application
Format Conf. & Ref. SW.
w10464 Systems
Text of ISO/IEC 23000-10 FPDAM1 Video Surveillance Application
Format Conf. & Ref. SW.
w10466 Systems
WD 1.0 of ISO/IEC 23000-11/AMD1 Stereoscopic Video Application
Format Conf. & Ref. SW.
w10467 Systems
Text of ISO/IEC 23000-11/DCOR1 (SVAF signalling of voice codecs)
w10468 Systems
Text of ISO/IEC 23000-12 CD Interactive Music AF
w10469 Systems
Proposal for new work item
w10470 Systems
Text of ISO/IEC 23006-1 CD MxM Architecture and Technologies
w10471 Systems
Text of ISO/IEC 23006-2 CD MXM APIs
w10472 Systems
Text of ISO/IEC 23006-3 CD MXM Conf. & Ref. SW
w10473 Systems
Text of ISO/IEC 29116-1 2nd edition MXM Protocols
w10474 Systems
WD of Architecture
w10475 Systems
WD of Sensory Information
w10476 Systems
WD of Avatar Information
w10477 Systems
WD of Control Information
w10478 Systems
Ideas on the new AIT project
w10479 Systems
Ideas on How to Implement Collaboration Between MPEG and ITU-T
51
Q.13/SG16 on the Advanced IPTV Terminal Standardisation
w10480 Systems
MPEG Schema Assets Updates
w10481 Systems
MPEG URIs and MIME Types
w10482 Convener
Liaison statement to W3C on MXM
w10483 Convener
Liaison statement to WG 1 on PA-AF
w10484 Convener
Liaison statement to ITU-T SG16 on IPTV
w10485 Convener
Liaison statement to OMA BCAST on ISO/IEC 14496-20
w10486 Convener
Liaison statement to IEC TC 100 on HD Recorder/Receiver Interface
w10487 Convener
Liaison statement to IEC TC 100 on IP & TS based service acess
w10488 Convener
Liaison statement to IEC TC 100 on digital right permission code
w10489 Convener
Liaison statement to JTC 1 Study Group on Sensor Network
w10490 Convener
Liaison statement to ISO TC 223 on Video Surveillance
w10491 Convener
Liaison statement to FNB on Informal workshop on Video Surveillance
w10492 Convener
Liaison statement to SC27 on new work item regarding digital evidence
w10493 Convener
Liaison statement to EDItEUR on MVCO
w10494 Convener
Liaison statement to IPFI on MVCO
w10495 Convener
Liaison statement to DOI on MVCO
w10496 Requirements MPEG Modern Transport (MMT) over Networks
w10497 Requirements Draft Advanced IPTV Terminal (AIT) Requirements
w10498 Requirements Requirements for MPEG-V Version 3.2
w10499 Systems
Text of ISO/IEC 14496-12:2008/DCOR 3
w10500 Systems
Request for ISO/IEC 14496-14:2003/PDAM1 Handling of MPEG-4 Audio
Enhancement Layers
w10501 Systems
Text of ISO/IEC 14496-14:2003/PDAM1 Handling of MPEG-4 Audio
Enhancement Layers
w10502 Convener
Liaison Statement to SMPTE 23B/Container on Package Formats
w10503 Requirements
Requirements v2.0 for a new BIFS profile to support Interactive Digital
Radio
w10504 Convener
Liaison statement to WorldDMB on new BIFS profile
w10505 Convener
Liaison statement to GRN on new BIFS profile
w10506 Convener
Liaison statement to TTA on new BIFS profile
w10507 Convener
List of identified non MPEG members to be allowed to access MPEG SVN
repository
w10508 Convener
AHG on Scene Representation
52
w10509 Convener
AHG on MPEG File Formats
w10510 Convener
AHG on Application Format
w10511 Convener
AHG on MVCO - ignore
w10512 Convener
AHG on MPEG-V (including previous RoSE activities)
w10513 Convener
AHG on Font Format Representation
w10514 Convener
AHG on Advanced IPTV Terminal
w10515 Convener
AHG on MXM
w10516 Convener
Liaison Statement: frame lengths in the AAC family profiles
53
Annex E – Requirements report
Source: Jörn Ostermann (Leibniz Universität Hannover)
1
Requirements documents approved at this meeting
No.
10357
10358
10359
10360
10361
10362
10363
10496
10497
10498
10503
2
Title
Vision on 3D Video Coding
Applications and Requirements of 3D Video Coding
Call for 3D Test Material: Depth Maps & Supplementary Information
Description of Exploration Experiments in 3D Video Coding
Vision and Requirements for High-Performance Video Coding (HVC)
Call for Test Materials for High-Performance Video Coding
Standardisation
Draft Call for Evidence on High-Performance Video Coding
MPEG Modern Transport (MMT) over Networks
Draft Advanced IPTV Terminal (AIT) Requirements
Requirements for MPEG-V Version 3.2
Requirements v2.0 for a new BIFS profile to support Interactive Digital
Radio
Multiview Video Coding (MVC)
In response to resolution 3.5.4 of the 86th WG11 meeting, MPEG received input from three National
Bodies requesting support for interlaced video in multiview video coding. Since companies also
promised to provide software and to support the necessary experiments, a WD for a new profile
MVC interlaced was started.
3
MPEG-V: Information exchange with virtual worlds
In response to the Call for Proposals (N10239), MPEG received four contributions. One contribution
provided significant technical detail to start the work. There was no input related to command and
control for MPEG-V. However, a new call for proposals is not necessary and MPEG started defining
the working drafts of the four parts of MPEG-V. Requirements for MPEG-V Version 3.2 were
established (N10498).
4
MAF
The main goal of MAFs is to define the collection of MPEG standards that enables the deployment
of an application. As such, a MAF needs to be focused, should show a demonstration of the
application, requires continuous input over the development time of the MAF and strong industry
support. The MAF Overview document (N10233/N10234) has not been updated.
After more than two years of inactivity, MPEG received again input on the topic of Advanced
Surveillance AF. Input document M16159 Extended template for the Advanced Surveillance AF with
more than 20 authors from mainly academic organisations provides clarified requirements with
reference to an old version of the MAF Overview document. Technology from MPEG-4, MPEG-7,
and MPEG-21 is required. The proposers are encouraged to show a demonstration and more support
from industry.
54
5
Video Signature Tools
In response to the call for video signature tools, MPEG received satisfactory input from five
organisations. The core experiment process was started by the video group.
6
Explorations
6.1
Exploration on MPEG User Interface Framework
The response to the call for proposals N10232 is expected for the 88th meeting.
6.2
Interactive Radio
Requirements for Interactive Radio were updated as captured in N10503 Requirements v2.0 for a
new BIFS profile to support Interactive Digital Radio. Furthermore, input for Requirements M16179
Comments on requirements for a new BIFS profile was processed. It appears that there was sufficient
technology available to MPEG such that there was no need to issue a Call for Proposals at the 87th
meeting. Instead, a working draft (N10439) was issued.
6.3
3D Video Coding
Following the vision to enable both advanced stereoscopic display processing and improved support
for auto-stereoscopic N-view displays as outlined in N10357 Vision on 3D Video Coding, MPEG
issued N10359 Call for 3D Test Material Depth Maps & Supplementary Information and N10360
Description of Exploration Experiments in 3D Video Coding. Work on applications and requirements
for 3D Video coding started (N10358). Applications are to be defined at the 88th meeting. The vision
does not outline a time line since the time line depends on the technology available to MPEG.
6.4
High Performance Video Coding (HVC)
HVC targets mobile services, IPTV, and Ultra High Definition (UHD) displays with a focus on
coding efficiency considering codec complexity as well. The current target is to increase coding
efficiency by 25% at low complexity and 50% at full complexity. MPEG foresees that the reduction
of complexity will be achieved by turning off some tools required to reach the full performance in
terms of coding efficiency.
Another Call for high quality test material (N10362) and a Draft call for evidence on high
performance video coding (N10363) was issued. Evaluation will take place prior to the 89th meeting.
Draft requirements are captured in Vision and Requirements for High-Performance Video Coding
(HVC) (N10361) which was further developed using input from M16207 Requirements fir highperformance video standards.
55
Bit Rate
Simulcast
3DV should be compatible with:
• existing standards
• mono and stereo devices
• existing or planned infrastructure
MVC
3DV
2D+Depth
2D
3D Rendering Capability
Figure 1: Envisioned performance of 3D Video Coding with respect so existing solutions.
Figure 2: Use case for Ultra High Definition Displays targeted by HVC.
6.5
Audio coding for HVC
With the use of ultra high definition displays, the appropriate audio environment has to be considered.
Shared (Figure 2) and individual UHD (Figure 3)experiences with a viewing distance of 50 cm
should be considered. The audio group is encouraged to evaluate current standards and their
suitability for HVC which will require the precise localisation of sound sources by the listeners and
might have to consider the current head position of the listener as well.
56
Figure 3: Displays with associated speakers
6.6
Advanced IPTV Terminal
Based on M16188 Proposal of Advanced IPTV Terminal (AIT) requirements, N10497 Draft
Advanced IPTV Terminal (AIT) Requirements was developed. It is envisioned that a joint project
with ITU-T Q13./SG16 could be started to define the architecture and technologies for an IPTV
terminal.
6.7
Modern MPEG Transport
Currently, MPEG standardizes bitstreams and IETF provides protocols for their transport. While
MPEG develops standards having error resilience and IP-networks in mind, a joint optimization of
coding and transport for IP networks is not done jointly by experts of MPEG and IETF.
According to N10496 MPEG Modern Transport (MMT) over Networks there is a need for a transport
and file format friendly stream format. Error resilience of current MPEG streams might not be
optimal. The potential gains of joint optimization of coding and transport are not known.
Conversions between different transport mechanisms like from MPEG-2 Transport Stream to MPEG
Program Stream are not straight forward or defined. Furthermore, MPEG does not provide any hint
on how to adapt content to different networks.
An extension and review of the issues is sought in order to determine a potential need for
contributions of MPEG and an adaptation of its internal standards development process.
57
Annex F – Systems report
Source: Young-Kwon Lim, Chair
1
List of output documents
The main outputs of the meeting from the Systems Sub-group perspective are:
No.
Title
X
10435
10436
10437
13818-1 MPEG-2 Systems
DoC on ISO/IEC 13818-1:2007/PDAM4 Transport of MVC
Text of ISO/IEC 13818-1:2007/FPDAM4 Transport of MVC
WD 1.0 of ISO/IEC 13818-1:2007 DCOR X
14496-4 Conformance
Text of ISO/IEC 14496-4:2004/FPDAM 37 File Format Conformance
Improvements
14496-11 BIFS
WD 1.0 of ISO/IEC 14496-11:2002/AMD X New BIFS profile
14496-12 ISO File Format
DoC on ISO/IEC 14496-12:200X/DCOR 2 Usage of brands and box order in
sample entry
Text of ISO/IEC 14496-12:200X/COR 2 Usage of brands and box order in
sample entry
Study of ISO/IEC 14496-12:200X/FPDAM 1 General Improvements
10438
X
10439
X
10440
10441
10442
10499 Text of ISO/IEC 14496-12:2008/DCOR 3
14496-14 MP4 File Format
10500 Request for ISO/IEC 14496-14:2003/PDAM1 Handling of
MPEG-4 Audio enhancement layers
10501 Text of ISO/IEC 14496-14:2003/PDAM1 Handling of MPEG-4
Audio enhancement layers
10443
10444
14496-15 AVC File Format
Text of ISO/IEC 14496-15:2004/COR3
DoC on ISO/IEC 14496-15:2004/PDAM 3 MVC File Format
10445 Text of ISO/IEC 14496-15:2004/FPDAM 3 MVC File Format
X
14496-20 LASeR
10446
10447
10448
10449
DoC on ISO/EC 14496-20:2008/PDAM 2 Adaptation
Text of ISO/EC 14496-20:2008/FPDAM 2 Adaptation
Workplan for service example of LASeR Adaptation & PSI
Clarification on the usage of ISO/IEC 14496-20 by other
standardization bodies
X
14496-22 Open Font Format
10450 DoC on ISO/IEC FCD 14496-22 2nd Edition
10451 Text of ISO/IEC FDIS 14496-22 2nd Edition
X
Available
No
No
No
06/02/09
20/02/09
06/02/09
No
06/02/09
No
06/02/09
No
06/02/09
No
06/02/09
No
No
23/02/09
06/02/09
No
06/02/09
No
06/02/09
No
No
06/02/09
06/02/09
No
23/02/09
No
Yes
No
Yes
06/02/09
27/02/09
06/02/09
06/02/09
No
No
06/02/09
27/03/09
No
06/02/09
No
06/03/09
No
No
06/02/09
06/03/09
15938-12 MPEG Query Format
10452 Text of ISO/IEC 15938-12:2008 /COR 1
X
10453
X
10454
10455
X
TBP
21000-2 Digital Item Declaration
WD2.0 of ISO/IEC 21000-2 AMD PSI
21000-19 Media Value Chain Ontology
Draft DoC on ISO/IEC CD 21000-19 Media Value Chain Ontology
Draft Text of ISO/IEC FCD 21000-19 Media Value Chain Ontology
23000-6 Professional Archival Application Format
58
10456 DoC on ISO/IEC 23000-6 PA-AF/PDAM 1 Conformance and
Reference Software
10457 Text of ISO/IEC 23000-6 PA-AF/FPDAM 1 Conformance and
Reference Software
No
06/02/09
No
06/02/09
10458
X
10459
10460
10461
No
06/02/09
No
No
No
06/02/09
06/02/09
06/02/09
No
06/02/09
No
06/02/09
No
06/16/09
No
06/02/09
No
06/02/09
No
06/02/09
No
No
No
No
06/02/09
27/02/09
27/02/09
27/02/09
No
No
No
No
No
06/02/09
20/03/09
20/03/09
20/03/09
06/02/09
No
27/03/09
Ideas on the new project
Ideas on How to Implement Collaboration Between MPEG and ITU-T
Q.13/SG16 on the Advanced IPTV Terminal Standardisation
Assets and Standing Documents
MPEG Schema Assets Updates
MPEG URIs and MIME Types
Liaison
No
No
06/02/09
06/02/09
No
No
06/02/09
06/02/09
Liaison statement to W3C on MXM
Liaison statement to WG 1 on PA-AF
Liaison statement to ITU-T SG16 on IPTV
Liaison statement to OMA BCAST on ISO/IEC 14496-20
Liaison statement to IEC TC 100 on HD Recorder/Receiver
Interface
Liaison statement to IEC TC 100 on IP & TS based service acess
Liaison statement to IEC TC 100 on digital right permission code
Liaison statement to JTC 1 Study Group on Sensor Network
Liaison statement to ISO TC 223 on Video Surveillance
No
No
No
No
No
06/02/09
06/02/09
06/02/09
06/02/09
06/02/09
No
No
No
No
06/02/09
06/02/09
06/02/09
06/02/09
10462
X
10463
10464
X
10466
10467
X
10468
X
10474
10475
10476
10477
X
10469
10470
10471
10472
10507
X
10473
X
10478
10479
X
10405
10409
X
10482
10483
10484
10485
10486
10487
10488
10489
10490
Workplan for ISO/IEC 23000-6 PA-AF Conformance and Reference Software
23000-9 Digital Multimedia Broadcasting Application Format
Text of ISO/IEC 23000-9:2008/FPDAM1 DMB AF Conf. And Ref. Soft.
Workplan for ISO/IEC 23000-9:2008/FPDAM1 DMB AF Conf. And Ref. Soft.
Request for ISO/IEC 23000-9:2008/PDAM2 DMB AF Harmonization of
MPEG-2 TS storage
Text of ISO/IEC 23000-9:2008/PDAM2 DMB AF Harmonization of MPEG-2
TS storage
23000-10 Video Surveillance Application Format
DoC on ISO/IEC 23000-10 PDAM1 Video Surveillance Application Format
Cof. & Ref. SW.
Text of ISO/IEC 23000-10 FPDAM1 Video Surveillance Application Format
Cof. & Ref. SW.
23000-11 Stereoscopic Video Application Format
WD 1.0 of ISO/IEC 23000-11/AMD1 Stereoscopic Video Application Format
Conf. & Ref. SW.
Text of ISO/IEC 23000-11/DCOR1 SVAF signalling of voice codecs
23000-12 Interactive Music AF
Text of ISO/IEC CD 23000-12 Interactive Music AF
23005 – MPEG-V
WD of Architecture
WD of Sensory Information
WD of Avatar Information
WD of Control Information
23006 – MPEG eXtensible Middleware
Proposal for new work item
Text of ISO/IEC CD 23006-1 MxM Architecture and Technologies
Text of ISO/IEC CD 23006-1 MXM APIs
Text of ISO/IEC CD 23006-1 MXM Conf. & Ref. SW
List of identified non MPEG members to be allowed to access MPEG SVN
repository
Supplemental Media Technologies – MPEG eXtensible Middleware
Text of ISO/IEC CD 29116-1 2nd edition MXM Protocols
Exploration – Advanced IPTV Terminal
59
10491
10492
10493
10494
10495
10502
10504
10505
10506
Liaison statement to FNB on Informal workshop on Video
Surveillance
Liaison statement to SC27 on new work item regarding digital
evidence
Liaison statement to EDItEUR on MVCO
Liaison statement to IPFI on MVCO
Liaison statement to DOI on MVCO
Liaison statement to SMPTE 23B/Container on package formats
Liaison statement to WorldDMB on new BIFS profile
Liaison statement to GRN on new BIFS profile
Liaison statement to TTA on new BIFS profile
60
No
06/02/09
No
06/02/09
No
No
No
No
No
No
No
06/02/09
06/02/09
06/02/09
06/02/09
06/02/09
06/02/09
06/02/09
2
General issues
2.1
List of standards under development
Pr Pt
2 1
4 1
4
4
4
4
4
4
4
4
4
4
4
7
21
A
A
A
A
A
A
A
A
A
A
B
E
M
M
M
V
V
V
V
Edit. Project
2007 AMD4
200x AMD4
4 2004 AMD37
5 2007 AMDxx
5 2007 AMDxx
5 2007 AMD23
12 200x AMD1
12 200x COR2
15 200x COR3
15 200x AMD3
20 200X AMD2
20 200X AMD3
22 2008 2nd Ed.
12 2008 COR1.
19 200x 1st Ed.
4 200x AMD2
5 200x 2nd Ed.
6 200x AMD1
8 200x AMD1
9 200x AMD1
9 200x AMD2
10 200x AMD1
11 200x 1st Ed.
11 200x COR1
12 200x 1st Ed.
2 200x AMD1
1 200x 2nd Ed.
8 200x 1st Ed.
1 200x 1st ed.
2 200x
200x
3 200x
200x
1 200x
200x
2 200x
200x
3 200x
200x
4 200x
200x
Description
Transport of MVC
-RA
-Use of LASeR in MPEG-2 &
MPEG-4 Systems
File Format Conf.
AVC File Format Ref. Soft
SVC File Format Ref. Soft
Synth. Texture Ref. Soft
Misc. Addition to FF
Brands & box orders
Minor corrections to AVC FF
MVC File Format
Scene Adaptation
PSI
Open Font Format
MPQF minor corrections
Media Value Chain Onto.
Prot. MSSAF Conf. & Soft
MS AF
PA-AF Conf. & Ref. SW
PVP AF Soft. And Conf.
DMB AF Soft. And Conf.
DMB AF MPEG-2 Storage
VS Conf. & Ref. SW
SVAF Ref. Soft. And Conf.
SVAF Voice codec signaling
Interactive Music AF
FRU Ref. Soft. And Conf.
MXM Protocols
Ref. Soft. and Conformance
MxM Architecture
MxM APIs
MxM Conf. & Ref. SW
Architecture
Sensory Information
Avatar Information
Control Interface
61
CfP
WD
CD
FCD FDIS
08/10 09/02 09/07
07/10
08/04 08/10
TBS
TBS
08/04
08/04
08/10
08/07
08/04 08/10
08/07
08/07 08/10
08/01
08/01
08/07 08/10
08/04
08/01 09/02
08/07
09/02 09/07
08/10 09/04
08/10 09/04
09/02
09/02
09/02 09/07
09/02 09/07
09/02 09/07
08/07 09/02
08/07 09/02
09/02 09/07
08/10 09/04
09/04 09/10
09/02 09/07
TBS
09/02 09/07 10/01
08/07 09/02 09/07
08/07 09/02 09/07
09/02
09/02 09/07 10/01
09/02 09/07 10/01
07/01
08/07
08/07
08/07
09/02
09/02
09/02
09/02
09/02
07/07
09/02
09/02
09/02
09/07
09/07
09/07
09/07
09/10
08/04
09/10
09/10
09/10
09/10
09/10
09/10
09/10
10/04
08/10
10/04
10/04
10/04
10/04
10/04
10/04
10/04
2.2
Standing Documents
Pr Pt
Documents
1
1 MPEG-1 White Paper – Multiplex Format
1
1 MPEG-1 White Paper – Terminal Architecture
1
1 MPEG-1 White Paper – Multiplexing and
Synchronization
2
1 MPEG-2 White Paper – Multiplex Format
2
1 MPEG-2 White Paper – Terminal Architecture
2
1 MPEG-2 White Paper – Multiplexing and
Synchronization
2 11 MPEG-2 White Paper – MPEG-2 IPMP
4
1 MPEG-4 White Paper – MPEG-4 Systems
4
1 MPEG-4 White Paper – Terminal Architecture
4
1 MPEG-4 White Paper – M4MuX
4
1 MPEG-4 White Paper – OCI
4
6 MPEG-4 White Paper – DMIF
4 11 MPEG-4 White Paper – BIFS
4 12 MPEG-4 White Paper – ISO File Format
4 14 MPEG-4 White Paper – MP4 File Format
4 15 MPEG-4 White Paper – AVC FF
4
4
4
4
4
4
4
7
7
21
A
A
A
B
E
E
E
No.
Meeting
N7675 05/07 Nice
N7676 05/07 Nice
N7677 05/07 Nice
N7678 05/07 Nice
N7679 05/07 Nice
N7680 05/07 Nice
N7503
N7504
N7610
N7921
N8148
N8149
N7608
N8150
N7923
N7924
05/07 Poznan
05/07 Poznan
05/10 Nice
06/01 Bangkok
06/04 Montreux
06/04 Montreux
05/10 Nice
06/04 Montreux
06/01 Bangkok
06/01 Bangkok
13
13
17
18
20
White Paper on MPEG-4 IPMP
MPEG IPMP Extensions Overview
White Paper on Streaming Text
White Paper on Font Compression and Streaming
Presentation Material on LASER
N7505
N6338
N7515
N7508
N6969
20
22
1
1
9
White Paper on LASeR
White Paper on Open Font Format
MPEG-7 White Paper - MPEG-7 Systems
MPEG-7 White Paper – Terminal Architecture
MPEG-21 White Paper – MPEG-21 File Format
N7507
N7519
N7509
N8151
N7925
05/07 Poznan
04/03 München
05/07 Poznan
05/07 Poznan
05/01 HongKong
05/07 Poznan
05/07 Poznan
05/07 Poznan
06/04 Montreux
06/01 Bangkok
X
X
X
X
X
MPEG Application Format Overview
MAF Overview Document
MAF Overview Presentation
MPEG-B White Paper – BinXML
MPEG Multimedia Middleware Context and
Objectives
1rst M3W White paper
2nd M3W White Paper : Architecture
N9421
N9840
N9841
N7922
N6335
07/10 Shenzhen
08/04 Archamps
08/04 Archamps
06/01 Bangkok
04/03 München
X
X
62
N7510 05/07 Poznan
N8152 06/04 Montreux
E
X
Tutorial on M3W
E
X
E
X
X
X
M3W White Paper : Multimedia Middleware
Architecture
M3W White Paper : Multimedia API
M3W White Paper : Component Model
M3W White Paper : Resource and Quality
Management
M3W White Paper : Component Download
M3W White Paper : Fault Management
M3W White Paper : System Integrity
Management
E
E
E
E
E
X
X
X
63
N8153 06/04 Monreux
N8687 06/10 Hanzhou
N8688 06/10 Hanzhou
N8689 06/10 Hanzhou
N8690 06/10 Hanzhou
N8691 06/10 Hanzhou
N8692 06/10 Hanzhou
N8693 06/10 Hanzhou
2.3
Mailing Lists Reminder
Topic
General Systems
List
File Format
LASeR
MAF
ISO File Format
Transport
AIT
Metaverse
RoSE
MXM
Information
Reflector : gen-sys@lists.uni-klu.ac.at
Subscribe: http://lists.uniklu.ac.at/mailman/listinfo/gen-sys
Archive: http://lists.uniklu.ac.at/mailman/private/gen-sys/
Reflector : mp4-sys@lists.uni-klu.ac.at
Subscribe: http://lists.uniklu.ac.at/mailman/listinfo/mp4-sys
Archive: http://lists.uniklu.ac.at/mailman/private/mp4-sys/
Reflector : mpeg-laser@lists.uni-klu.ac.at
Subscribe: http://lists.uniklu.ac.at/mailman/listinfo/mpeg-laser
Archive: http://lists.uni-klu.ac.at/pipermail/mpeglaser/
Reflector : maf-sys@lists.uni-klu.ac.at
Subscribe: http://lists.uniklu.ac.at/mailman/listinfo/maf-sys
Archive: http://lists.uniklu.ac.at/mailman/private/maf-sys/
Reflector: isoff-transport@lists.uni-klu.ac.at
Subscribe: http://lists.uniklu.ac.at/mailman/listinfo/isoff-transport
Archive: http://lists.uniklu.ac.at/mailman/private/isoff-transport/
Reflector: jiptv@lists.uni-klu.ac.at
Subscribe: http://lists.uniklu.ac.at/mailman/listinfo/jiptv
Archive: http://lists.uniklu.ac.at/mailman/private/jiptv/
Reflector: metaverse@lists.uni-klu.ac.at
Subscribe: http://lists.uniklu.ac.at/mailman/listinfo/metaverse
Archive: http://lists.uniklu.ac.at/mailman/private/metaverse/
Reflector: rose@lists.uni-klu.ac.at
Subscribe: http://lists.uniklu.ac.at/mailman/listinfo/rose
Archive: http://lists.uniklu.ac.at/mailman/private/rose/
Reflector: mxm@lists.uni-klu.ac.at
Subscribe: http://lists.uniklu.ac.at/mailman/listinfo/mxm
Archive: http://lists.uniklu.ac.at/mailman/listinfo/mxm
64
Kindly
Hosted by
Klagenfurt
University
Klagenfurt
University
Klagenfurt
University
Klagenfurt
University
Klagenfurt
University
Klagenfurt
University
Klagenfurt
University
Klagenfurt
University
Klagenfurt
University
2.4
2.4.1
Session
General
General Technical Issues
Contributions
Numbe
Title
r
m1595 Ad Hoc Group on Scene Representation
5
General
m1595
7
Ad Hoc Group on Application Format
General
m1595
1
m1595
8
Ad Hoc Group on MxM
m1595
4
m1595
9
m1595
6
m1596
0
m1602
9
m1597
0
m1597
2
m1597
5
m1600
7
M1622
3
Ad Hoc Group on Information Exchange
with Virtual Worlds
Ad Hoc Group on Advanced IPTV
Terminal
Ad Hoc Group on MPEG File Formats
General
General
General
General
General
General
General
General
General
General
General
General
General
M1622
6
M1622
7
Ad Hoc Group on the RoSE Framework
Ad Hoc Group on Font Format
Representation
Liaison Statement from IEC TC 100 [SC
29 N 10025]
Table of Replies on ISO/IEC FDIS 230008 [SC 29 N 9910]
Table of Replies on ISO/IEC 210007:2007/FDAM 1 [SC 29 N 9912]
IEC CDV 62546 [SC 29 N 9917]
(HD Recorder/receiver interface)
IEC CD 62455 [SC 29 N 9960]
(IP & TS based service access)
Liaison Statement from JTC 1 Study
Group on Sensor Network to SC
29
Liaison Statement from ISO TC 223
(Video surveillance)
Informal Workshop on Video Surveillance
Authors
Young-Kwon Lim,
Jaeyeon Song, Cyril
Concolato
Kyuheon Kim, Hui Yong
Kim, Jean Cha, Noboru
Harada, Hendry
Filippo Chiariglione,
Marius Preda
Sanghyun Joo, Jean
Gelissen, Christian
Timmerer
Jean Gelissen, Marius
Preda, Keiji Mitsubuchi
Xin Wang, Young Kwon
Lim
David Singer
Vladimir Levantovsky
IEC TC 100 via SC 29
Secretariat
ITTF via SC 29
Secretariat
ITTF via SC 29
Secretariat
IEC TC 100
IEC TC 100
JTC1 SGSN
TC 223
FNB
m15955, m15957, m15951, m15958, m15954, m15959, m15956, m15960
All AHG reports are reviewed. No specific comments are made.
m16007 Liaison informing CD of IEC 62455: Internet protocol (IP) and transport stream (TS)
based service access (TA1). Reply to thank you for the information and to provide MXM
standards.
65
m16029 Liaison informing the publication of the new IEC International Standard 62227
(IEC6222) Digital Rights Permission Code. Reply to thank you and to provide information
about REL.
m15975 Liaison informing CD of IEC 62546: HD Recording Link Guidelines. Reply to thank
you.
m16223 Liaison requesting contributions from us with information on your current work
(including the scope of your current projects) and the potential new areas for standardization
related to sensor networks in our field.
Reply to inform the context and objectives of MPEG-V and RoSE.
m16226 Liaison informing the proposed new work item regarding video surveillance format
interoperability. Reply to inform ISO base file format and Video Surveillance Application
Format
m16227 Liaison to invite to a Workshop on Digital Videosurveillance Format. Reply to thank
you.
2.4.2
Others
2.5
Demonstrations
None.
2.6
FAQ
The FAQ were updated as needed.
2.7
AOB
None.
66
3
3.1
MPEG-2 Systems (13818-1)
13818-1:2007 Amd.3 Carriage of SVC
3.1.1
Topics
1.
3.1.2
Sessio
n
Genera
l
Transport of Scalable Video Coding
Contributions
Numbe
Title
r
m1598 Table of Replies on ISO/IEC 138185
1:2007/FDAM 3
Authors
ITTF via SC 29
Secretariat
m15985 ISO/IEC 13818-1:2007 / FDAM3 is approved.
.
Work Completed
3.2
13818-1:2007 Amd.4 Transport of MVC
3.2.1
Topics
1.
3.2.2
Sessio
n
Genera
l
Transport of Multiview Video Coding
Contributions
Numbe
Title
r
m1598 Summary of Voting on ISO/IEC 138188
1:2007/PDAM 4
Authors
SC 29 Secretariat
m15988 One disapproval with comments from USNB.
Technical Work in Progress.
3.3
13818-1:2007 Cor.X
3.3.1
Topics
1.
3.3.2
Sessio
n
Genera
l
Potential Corrigendum
Contributions
Numbe
Title
r
m1611 Potential Corrigendum Items for MPEG-2
1
Systems
67
Authors
Tomoo Yamakage
m16111 This contribution proposes two potential corrigendum items regarding removal rate
from transport buffer and unit of VBV buffers size. Agreed to start a working draft at this
meeting
Technical Work in Progress.
4
4.1
MPEG-4 Conformance (14496-4)
14496-4 Amd.37 File Format Conformance Improvements
4.1.1
Topics
1.
4.1.2
Sessio
n
FF
File Format Conformance
Contributions
Numbe
Title
r
m1599 Summary of Voting on ISO/IEC 144960
4:2004/PDAM 37
Authors
SC 29 Secretariat
m15990 All approved.
Technical Work in Progress.
5
5.1
MPEG-4 BIFS (14496-11)
14496-11/AMD X Digital Radio Profile
5.1.1
Topics
1.
5.1.2
Sessio
n
Scene
Scene
Scene
Scene
Scene
Digital Radio Profile
Contributions
Numbe
r
m1619
4
m1600
5
m1621
7
m1617
9
m1605
Title
Authors
Liaison Statement from the World DMB
Forum
Liaison Statement from TTA [SC 29 N
9959]
Liaison Statement from GRN Consortium
Comments on requirements for a new BIFS
profile
Novel approaches to remote display
68
World DMB Forum via
SC 29 Secretariat
TTA via SC 29
Secretariat
GRN Consortium via SC
29 Secretariat
Cyril Concolato
Jean Le Feuvre
Benoit Pellan
Mihai MITREA
8
representations: BiFS-based solution and its
deployment within the FP7 MobiThin
project
Pieter SIMOENS
Bojan JOVESKI
Bert VANKEIRSBILCK
Abdeslam
TAGUENGAYTE
Françoise PRETEUX
m16058 This contribute introduces preliminary results of converting X11 commands to BIFS
commands. This can be used to people can access the screen of their computer from mobile
phone if the computer uses X11 commands. This is part of EU FP7 projects.
m16179 This contribution reports draft analysis on the list technologies to fulfill the
requirement prepared at the Busan meeting to support Interactive Digital Radio applications.
No solutions to support focus navigation, caching and scene state management are found.
Probably this might need verification experiment to measure required complexity to
implement.
m16194 This liaison proposes addition requirements to new BIFS profile under consideration
as follows:
 no modification to underlying delivery layers
 supporting all requirements that aim at extending the MPEG-4 BIFS Core2D@Level1
functionalities to either improve BIFS coding efficiency or minimize the use of coded
images (JPEG or PNG) by replacing them with graphics elements
 introducing mechanisms allowing the integration of DAB applications
Agreed to reply with the latest requirement document and working draft containing proposed
new BIFS profile.
m16005 This liaison supports creation of new BIFS profile and propose continuous
information exchange. Agreed to reply with the latest requirement document and working
draft containing proposed new BIFS profile.
m16217 This liaison supports creation of new BIFS profile and the list of requirements
identified. Agreed to reply with the latest requirement document and working draft containing
proposed new BIFS profile
Technical Work In Progress.
6
6.1
MPEG-4 ISO Base File Format (14496-12)
14496-12/COR 2 Usage of brands and box order in sample entry
6.1.1
Topics
1.
6.1.2
Sessio
n
Corrigendum Items
Contributions
Numbe
r
Title
Authors
69
FF
m1600
1
Summary of Voting on ISO/IEC 1449612:200X/DCOR 2 & ISO/IEC 1544412:200X/DCOR 2
SC 29 Secretariat
m16001 The comment is indeed spot on; we clarified this in an amendment to ISO2.
However, it did change reader behavior. We feel that some review and comment on this
correction should be permitted, so we will issue a new Cor using the proposed text, and
inviting comment
Technical Work In Progress.
6.2
14496-12:200X/AMD 1 General improvements
6.2.1
Topics
1.
6.2.2
Sessio
n
FF
FF
FF
FF
FF
Amendment
Contributions
Numbe
r
m1618
1
m1618
2
m1611
5
m1608
9
m1622
5
Title
Authors
Errata report for 14496-12:2005 (ISO Base
Media File Format)
On Movie Fragments, Edit Lists, and other
timing questions, for 14496-12 (ISO Base
Media File Format)
On FPDAM1 of ISO Base Media File
Format
Comments on 14496-12:200X FPDAM1
David Singer
Miscellaneous comments on ISO/IEC
14496-12:2008 FPDAM1
Miska M. Hannuksela
Stefan Döhla
David Singer
Miska M. Hannuksela
Teruhiko Suzuki
m16181 Editors to integrate into DCOR, or work with the secretariat, as appropriate.
m16182 Curious. Do we prefer replacement edit lists, which allow the writer to simplify life
for the reader (no calculations to see if edits should be merged, the writer did it), but also raise
the possibility of ‘re-defining the past’, or elf’s, which have the opposite characteristics?
There are questions of complexity here, of course. To think about, more comment welcome.
m16115 Oh dear. We have never said whether track references are always ‘strong’ (though
most, if not all, are today). We might want to clarify the status of references while we are at it.
Is a referencing track still useful if the target is removed? Do we need a better mechanism to
indicate track semantic grouping? Perhaps a track-grouping box, adjacent to track-reference,
that has a grouping-type and a group-id, and/or a track pointer? Perhaps also a strong/weak
grouping indicator? We’ll put something in the study and encourage NB review.
The startup sample group seems cool on first look. It could do with an example worked
through, and we like the idea of using the grouping_parameter to indicate multiple different
startup rolls. Does this go in the base spec. or the AVC/SVC/MVC spec.? Roll sample
groups are part 12 – but they work for audio, these don’t. Into the study.
70
m16089 Yes, we should restrict ctts version 1 to iso4-branded files, in the annex and probably
in the ctts section. The tutorial section should be updated “if you want composition to start at
time 0, you can…”. Thank you.
m16225 “There shall be at most one of each of …”. “If it is expected that the RTP … will be
used…”.
OK, editors to integrate in the study.
Technical Work in Progress.
7
7.1
MPEG-4 File Format (14496-14)
General
7.1.1
Topics
1.
7.1.2
Sessio
n
FF
New proposal
Contributions
Numbe
Title
r
m1605 Scalable Audio and MP4
4
Authors
Stefan Doehla
m16054 We need an amendment to Part14 to introduce the codec (sample entry) type ‘m4ae’
(MPEG-4 Audio Extension), and document its relationship with ‘dpnd’ track dependencies.
Mr Döhla to supply the PDAM text and the request for amendment. Supporting NBs:
Germany, France, Sweden, Singapore, USA.
7.1.3
Others
Thoughts on MPEG Surround signaling (m16117) This was handled in Audio, joint
session. We agreed on the direction and Audio will deal with it in the audio specs. No action
here.
8
MPEG-4 AVC Base File Format (14496-15)
71
8.1
14496-15:2004/COR.3
8.1.1
Topics
2.
Corrigendum
8.1.2
Sessio
n
FF
Contributions
Numbe
Title
r
m1596 Summary of Voting on ISO/IEC 144967
15:2004/DCOR 3 [SC 29 N 9880]
Authors
SC 29 Secretariat
m15967 All approved, thank you.
8.2
14496-15:2004/AMD 3
8.2.1
Topics
1.
8.2.2
Sessio
n
FF
FF
FF
FF
MVC File Format
Contributions
Numbe
r
m1599
3
m1610
2
m1611
4
m1618
0
Title
Authors
Summary of Voting on ISO/IEC 1449615:2004/PDAM 3
Updates to the MVC File Format
On MVC File Format
On the MVC File format (14496-15
amendment)
SC 29 Secretariat
Zhuangfei Wu
Per Fröjdh
Miska M. Hannuksela
Ying Chen
David Singer
m15993 Thank you for the comments. We are concerned, but do not know how to resolve,
about the question of track_ids. Some comments are not completely resolved to our
satisfaction; we may need to revisit some of these issues.
m16102 Thank you. We wonder about the view priority: only sample entry (now), both
entry and group box (as resolved), only the multiview group box (possible), a separate sample
group (as proposed) – or solve the whole problem with timed meta-data (like SVC)? We
allow it near view identifier for now.
The simplified view information is nice, but we had a lot of discussion on whether “adjacent”
is always well defined, and so on. (We prefer 0=undefined, 1=left, 2=right, 3=ordered
linearly but left/right not well defined.) We wonder if we need both local and global simple
view ordering information; we keep the global but delete the local. We wonder if we are
clear that lines are straight; what about (inflexion-point-free curved) arcs? We will include
that case also.
m16114 On the editorial and alignment material, accepted, thank you.
72
On the priority question, this is a-temporal, which worries us, but related to groups rather than
views, which is attractive. The question above on the whole treatment of priority also applies.
For now, we don’t put in the paragraph on ordering multiview groups.
We remove view association.
On the view relations, we might need a ‘differs in geometry differentiating code point’
(geom), e.g. that group is planar and this one is spherical. We put these code-points in, but
note that it provides some overlap with the information in the global view information (which,
for now, we retain).
We agree to delete the local supplementary view information completely (F.6.3.1.5 to 9).
The idea of ordering of views is interesting, but the MVCG doesn’t strictly list views, but tiers.
To consider further. Thank you for the disparity improvements.
m16180 Thank you. We will attempt to write a clear introduction, and a failure would
indicate that we need to simplify more!
9
LASeR (14496-20)
9.1
14496-20:200X/AMD 2 Adaptation
9.1.1
Topics
1.
9.1.2
Sessio
n
Scene
Scene
Scene
Adaptation
Contributions
Numbe
Title
r
m1596 Summary of Voting on ISO/IEC 144968
20:200X 2nd Edition/PDAM 2 [SC29 N
9881]
m1621 Late KNB comment on 14496-20 PDAM2
3
m1607 Study text on 14496-20 PDAM2
3
Scene
m1607
4
Improvement of parsingSwitch on 14496-20
PDAM2
Scene
m1607
5
Service Scenario examples on 14496-20
PDAM2
Scene
m1608
5
m1623
0
Comments on LASeR PDAM2
Scene
FNB Comments on LASeR PDAM2
m15968 Approved without comments
73
Authors
SC 29 Secretariat
Korea National Body
Seo-Young Hwang
Jaeyeon Song
Young-Kwon Lim
Seo-Young Hwang
Jaeyeon Song
Young-Kwon Lim
Seo-Young Hwang
Jaeyeon Song
Young-Kwon Lim
jean Le Feuvre
Cyril Concolato
jean Le Feuvre
Cyril Concolato
m16213 Late KNB comments asking to accept the changes made in Study text from Busan
meeting and proposed technical modifications in m16073 and 16074
m16073 This contribution proposed solutions to several open questions
 Proposed to add new attribute “minResolution” in DPI for screen size adaptation 
accepted with a note that square pixel is assumed
 Proposed to add new attribute “minSize” for text adaptation  accepted with
modification that option will be the separate attribute
 Proposed to add parameter for MemoryStatus event, the number of points, the number
of Unicode characters, and the size of composition buffer  accepted
m16074 This contribution proposed additional attributes for parsingSwitch
 Proposed to add new attribute “mode” & “removable”  accepted with modification,
adding one more mode, “ascending” and removing “removable”
m16075 This contribution illustrate the example use case of scene adaptation under the IPTV
service environment  agreed to add an AHG mandate to create demonstration content based
on this service scenario
m16085 This contribution proposes
 Simple media object filtering  accepted to put into TuC (need more investigation on
the relevance of categorization and completeness of lists)
 User input filtering  accepted to put into TuC (need more investigation on
completeness of lists)
 Advanced content constraints  accepted to put into FPDAM
m16230 This is official FNB comment version of m16085
Technical Work in Progress.
9.2
14496-20:200X/AMD 3 PSI
9.2.1
Topics
1.
9.2.2
Sessio
n
Scene
Presentation of Structured Information
Contributions
Numbe
Title
r
m1600 Summary of Voting on ISO/IEC 144960
20:200X/PDAM 3
Authors
SC 29 Secretariat
m16000 Approved with one comment from KNB. Unfortunately no experts to answer this
comment attend this meeting. So, disposition is delayed until the next meeting.
74
10
Open Font Format (14496-22)
10.1
14496-22:200X 2nd edition
10.1.1
Topics
1.
10.1.2
Sessio
n
Genera
l
Open Font Format
Contributions
Numbe
Title
r
m1598 Summary of Voting on ISO/IEC FCD
4
14496-22 [2nd Edition]
Authors
SC 29 Secretariat
m15984 Comments from JNB and USNB are all accepted. Need editing period until the end
of March.
Technical Work in Progress.
10.2
14496-22:200X Amd.X
10.2.1
Topics
1.
Open Font Format Extension
10.2.2
Contributions
Sessio Numbe
Title
Authors
n
r
Genera m1602 Proposal for a new work item for ISO/IEC
Simon Daniels
l
3
14496-22
Vladimir Levantovsky
m16023 This contribution propose to establish an AHG with the mandate to explore possible
ways to overcome the 64K limit of the existing font format specification, which would allow
supporting full Unicode character repertoire without an adverse effect on existing
implementations.  Agreed to establish AHG with proposed mandate.
Technical Work in Progress.
11
11.1
MPEG-7
15938-12 MPEG Query Format
11.1.1
Topics
1.
Corrigendum
75
11.1.2
Sessio
n
Genera
l
Genera
l
Contributions
Numbe
r
m1597
1
m1599
4
Title
Authors
Table of Replies on ISO/IEC FDIS 1593812 [SC 29 N 9911]
Summary of Voting on ISO/IEC 1593812:2008/DCOR 1
ITTF via SC 29
Secretariat
SC 29 Secretariat
m15971 ISO/IEC FDIS 15938-12 is approved.
m15994 ISO/IEC 15938-12:2008/DCOR1 is approved without any comments.
Technical Work in Progress.
12
12.1
21000 MPEG-21
21000-2 DID
12.1.1
Topics
1.
12.1.2
Sessio
n
Scene
PSI
Contributions
Numbe
Title
r
m1615 On WD 1.0 of ISO/IEC 21000-2:2005
5
AMD1 (PSI)
Authors
Christian Timmerer
m16155 Raising open questions regarding the place to embed presentation element and
proposing to include example. Accept to be included in new Working Draft.
Technical Work In Progress.
12.2
21000-19 Media Value Chain Ontology
12.2.1
Topics
1.
Media Value Chain Ontology
12.2.2
Contributions
Sessio Numbe
Title
n
r
MVCO m1599 Summary of Voting on ISO/IEC CD 210005
19
MVCO m1602 Liaison Statement from IFPI [SC 29 N
0
9995]
MVCO m1623 Liaison Statement from EDItEUR
1
76
Authors
SC 29 Secretariat
IFPI via SC 29 Secretariat
MVCO
m1619
3
Liaison Statement from the International
DOI Foundation (IDF)
IDF via SC 29 Secretariat
m15995 One approval with comment from Spain, Three disapproval with comments from
Japan, Germany and USA. Unfortunately comments are not finally disposed during the
meeting because of lack of participants from certain Nation Body and lack of time to discuss
the issues. Final disposition is delayed until next meeting.
m16020, m16193, and m16231 Similar liaisons raising confusion about the purpose of
MVCO and its relationship with RDD. Agreed to send replies explaining that the MVCO is
intended as a fully machine readable ontology with a core model centered on the value chain
while the RDD was designed to be implemented as a rights dictionary for referencing terms
and their human definitions.
Technical Work In Progress.
13
MPEG-A MAF (23000)
13.1
23000-6 Professional Archival AF
13.1.1
Topics
1.
Professional Archival AF
13.1.2
Sessio
n
MAF
MAF
MAF
Contributions
Numbe
r
m1599
6
m1608
3
m1617
6
Title
Authors
Summary of Voting on ISO/IEC 230006:200X/PDAM 1
Liaison Statement from SC 29/WG 1
Status report on ISO/IEC 23000-6
Professional Archival Application Format
Reference Software and Conformance files
SC 29 Secretariat
WG 1 via SC 29
Secretariat
Houari Sabirin
Hendry
Noboru Harada
Munchurl Kim
m15996 PDAM registration is approved without any comment. PDAM text is approved with
one comment from JNB asking conf. sw. and bitstream as soon as possible. This is accepted.
m16083 JPEG is working on a Digital Cinema archival format. The packaging format will be
based on MPEG-21 File Format and willing to use PA AF as a base and extend it to fulfill
their requirements. We will inform the progress of PA AF conf. and ref. sw. and invite to join
the AHG reflector for further discussion.
m16176 The first version of PA AF packager and extractor are developed. Only one
conformance point can be checked with this software. API will be developed further. This
work will be completed by July 2009.
77
Technical Work in Progress.
13.2
23000-9 DMB Application Format
13.2.1
Topics
1.
13.2.2
Sessio
n
MAF
MAF
DMB Application Format
Contributions
Numbe
r
m1596
9
m1607
8
Title
Authors
Summary of Voting on ISO/IEC 230009:2008/PDAM 1 [SC 29 N 9882]
Updated text, conf. files, and ref. sw for
ISO/IEC 23000-9 (DMB-AF)
SC 29 Secretariat
Hui Yong Kim
Myung Seok Ki
HanKyu Lee
Houari Sabirin
Munchurl Kim
Jung Soo Lee
Yong Han Kim
m15969 PDAM has been approved without any comments
m16078 No major changes to the workplan. The softwares and bitstream will be ready by
July 2009
Technical Work in Progress.
13.3
23000-10 Video Surveillance MAF
13.3.1
Topics
1.
13.3.2
Sessio
n
MAF
Video Surveillance MAF
Contributions
Numbe
Title
r
m1597 Summary of Voting on ISO/IEC 230007
10:200X/PDAM 1 [SC 29 N 9929]
Authors
SC 29 Secretariat
m15977 Editorial comments from Japan, US, Germany are all aceepted. One major technical
comment from UK is also accepted but needs two weeks of editing period to implement.
Technical Work in Progress.
78
13.4
23000-11 Stereoscopic Video AF
13.4.1
Topics
1.
13.4.2
Sessio
n
MAF
MAF
Stereoscopic Video AF
Contributions
Numbe
Title
r
m1613 Proposed Text for WD of ISO/IEC 230003
11 Stereoscopic Video AF Reference
Software
m1622 Proposed Corrigendum on ISO/IEC 230002
11 Stereoscopic Video Application Format
Authors
Next generation
Broadcasting
Forum(Korea)
Next generation
Broadcasting
Forum(Korea)
m16133 The firs version of SSVAF player has been implemented. The final version will be
developed by July 2009.
m16222 It was identified during the editing of the final text of FDIS that the descriptor to
identify the type of voice codec used is missing. Editors agreed to issue a DCOR at this
meeting by adding Sample Entry Boxes as used in 3GP file format standard.
Technical Work in Progress.
13.5
23000-12 Interactive Music AF
13.5.1
Topics
1.
13.5.2
Sessio
n
MAF
Interactive Music AF
Contributions
Numbe
Title
r
m1608 Study text of ISO/IEC 23000-12 WD
1
Interactive music application format
MAF
m1613
1
Report of Mini Experiment on IM AF
Constraints representation
MAF
m1613
2
Constraints Specifications for IM AF
MAF
m1613
Constraints representation method for IM
79
Authors
Inseon Jang
Huiyong Kim
Jeongil Seo
Laurent Primaux
Owen Lagadec
Emmanuel Bouix
Fabien Gallot
Inseon Jang
Hui Yong Kim
Jeongil Seo
Kyeongok Kang
Laurent Primaux
Owen Lagadec
Emmanuel Bouix
Fabien Gallot
Laurent Primaux
4
MAF
AF
m1621
4
A proposal for streaming support for IM AF
Owen Lagadec
Emmanuel Bouix
Fabien Gallot
Inseon Jang
Hui Yong Kim
Jeongil Seo
Kyeongok Kang
Yongwei Zhu
Susanto Rahardja
Te Li
Haibin Huang
m16132 Two constrains, Selection Constraint and Mixing Constraint have been described.
This has been used as a base of mini experiment
m16131 Two method, ISO File Format and MPEG-21 DIA UCD were compared to verify
which represents constraints better. ISO File Format fulfills all requirements more efficiently
than MPEG-21 DIA. ISO File Format is recommended to be used.
m16134 This contribution proposes Constraints representation method based on ISO File
Format. Accepted to be included in the CD.
m16081 Improved text of previous WD.
m16214 Propose to include SLS codec to support scalability. However, scalability does not
seem to be an appropriate requirement for IMAF.
14
Project Started
14.1
MPEG eXtensible Middleware
14.1.1
Topics
1.
2.
14.1.2
Sessio
n
MxM
MxM
MxM
MxM
MPEG eXtensible Middleware Architecture and Technology
MXM APIs
Contributions
Numbe
r
m1618
3
m1618
4
m1603
9
m1612
6
Title
Authors
Proposed WD3.0 of MxM Architecture and
Technologies
Proposed WD3.0 of MxM APIs
Filippo Chiariglione
Proposal for new APIs of video metadata on
MXM APIs
DMAG-UPC Comments on WD2.0 of
MXM API
Wonsuk Lee
Seungyun Lee
Jaime Delgado
Eva Rodríguez
Víctor Rodríguez-Doncel
Silvia Llorente
80
Filippo Chiariglione
MxM
MxM
MxM
MxM
MxM
MxM
MxM
MxM
m1618
5
m1618
6
m1603
7
m1615
8
m1618
7
m1615
1
Proposed WD2.0 of MxM Ref. SW. and
Conf.
Proposed WD2.0 of 2nd edition of ISO/IEC
29116-1 (MXM Protocols)
Proposal of MXM Ontology for inter-MXM
communication protocols
Updates for the MPEG Extensible
Middleware
MXM use-case proposals for 3D services
m1600
3
m1612
8
Liaison Statement from W3C [SC 29 N
9930]
Presentation of the W3C MAWG Activities
MXM API for 3D Graphics content creation
Rubén Barrio
Víctor Torres
Filippo Chiariglione
Filippo Chiariglione
Kangchan Lee
Seungyun Lee
Christian Timmerer
Patrick Gioia
Ivica Arsov
marius.preda@int-evry.fr
Françoise Preteux
W3C via SC 29
Secretariat
Victor Rodriguez-Doncel
Jaime Delgado
Ruben Tous
m16183 - Not many big changes compared to the version approved in Busan
m16184 - Recommendation: Use Doxygen or similar tools to generate API specification
automatically.
m16037 - We expect further contributions in terms of use cases and requirements that MXM
currently does not support in order to understand whether an MXM ontology is needed, and if
so which would be the best place to accommodate it.
m16039 - Having a generic API to set/get typical metadata fields not only MPEG-7 based
would be desirable. Recommendation: add generic API next to more MPEG-7 oriented API to
support different types of metadata within MXM
m16126 - All proposed additions were accepted; we will leave the specific work on the API
to the editing period
m16128 - A liaison statement was written.
m16151 - Very good progress on 3D Graphics part of MediaFramework engine, now having
both creation and access API. We need a uniform approach for dealing with file format issues
in MXM
m16158 - DIA APIs are in a very good shape. Reference software available in Java
m16187 - Interesting use cases to be considered within MXM Protocols
m16220 - Recommendation: add the caliph and the MPEG7 JRS to the set of MXM APIS;
consider
harmonisation with MXM style - harmonise the documentation and the code
style
Technical Work in Progress.
14.2
Representation of Sensory Effects
14.2.1
Topics
1.
Representation of Sensory Effects
81
14.2.2
Sessio
n
RoSE
Contributions
Numbe
r
m1602
4
m1615
7
m1616
7
m1616
9
MPEG Representation of Sensory Effects
Vision
Minor Corrections to RoSE WD 2.0 XML
Schema
Updates and Additional Tools for MPEG
RoSE
A simple RoSE system implementation
including SDC, USP, and SDCom
RoSE
m1617
0
A demonstration for reference color type
and its parameters in RoSE
RoSE
m1619
1
m1619
2
Study on Sensory Effect Metadata
RoSE
m1620
1
Comments and Proposal for Sensory Effect
Metadata
RoSE
m1620
3
A proposal for RoSE system architecture
RoSE
RoSE
RoSE
RoSE
Title
Authors
Proposal for Sensory Effect Metadata
Christian Timmerer
Markus Waltl
Christian Timmerer
Markus Waltl
Christian Timmerer
Jin-Seo Kim
Maeng-Sub Cho
Bon-Ki Koo
Yong Soo Joo
Sang-Kyun Kim
Jin-Seo Kim
Maeng-Sub Cho
Bon-Ki Koo
Yong Soo Joo
Sang-Kyun Kim
Yasuaki Tokumo
Shin-ya Hasegawa
Yasuaki Tokumo
Shin-ya Hasegawa
Takuya Iwanami
B. S. Choi
SangHyun Joo
HaeRyong Lee
KwangRo Park
Sanghyun Joo
m16157
Summary:
- Minor corrections for XML schema in WD2.0
Evaluations:
- Agreed.
m16167
Summary:
- Classification schemes for human readability, extensibility
- Sensory effect pattern
- New sensory effect types for water sprayer, perfumer, fog, blind, and sound
Evaluations:
- CS: ok, adopted
- Sensory effect pattern: not adopted
- Water sprayer: ok, adopted
- Perfumer: ok, adopted
- Fog: ok, adopted
82
- Window blind: basically adopted but needs to be aligned m16201 (shading); add’l
attributes might be ‘range’, ‘speed’, …
- Sound: not sure because this could be also integrated into the audio channel(s) of the
movie. Needs further evidence before being included as a new sensory effect type.
m16169
Summary:
- Demonstration of implementation of RoSE system applied the “wind” sensory effect
Evaluations:
- Informative report for demonstration purposes only.
- The BoG thanks the authors for this invaluable contribution and solicits further
contributions like that.
m16170
Summary:
- New sensory effect type for color correction effect
- ReferenceColorParameterType as extension of the ParameterBaseType
- ReferenceColor Effect: basically turn on/off
Evaluations:
- Reference color effect can be applied to certain scenes only (e.g., advertisement,
coloured scenes in a black/white movie)
- Size of the parameters is constant and independent of the number of scenes where
the correction should be applied
- Name should be changed to “color correction”
m16191
Summary:
- Study on TuC regarding representation of position
- Clarification of WD 2.0 regarding SEM Adaptability
- Abstract Position Table (APT) + Specific Location Table (SLT)
Evaluations:
- Harmonize with m16201, e.g., adopt a classification scheme with the positions in a
hierarchical way that may be grouped according to a certain criteria (see also below).
- Proposed changes for ‘adaptType’ as informative note
m16192
Summary:
- Hint information for fragmentation
- Improvement for time model
- Hint information for automatic extraction
Evaluations:
- Hint information for seamless play: interesting, but this is considered something to
be handled by the RoSE engine and does not require explicit signalling in metadata
- DTS: interesting, but warm-up/initialization time is an issue for the device
capabilities but not for the sensory effect metadata because ‘dts’ cannot be known at
authoring stage
- Fade-in/-out-value: interesting but possible with existing tools, i.e., (1) start up to
intensity Ii and (2) fade-in from Ii to I. We should consider including this as an
example.
- Hint information for automatic extraction: adopt ‘autoExtraction’ but signal this at
the beginning in the ‘header’ of the sensory effect metadata
83
m16201
Summary:
- Comments on TuC
- New sensory effect types for Flash, Color Light, heating/cooling, shading
Evaluations:
- position: we that but we’ll adopt a classification scheme approach and develop a
wild-card mechanism for, e.g., addressing all effects on the left-hand side
- direction: withdrawn
- namedColor & colorInRGB: adopt both, i.e., “Standard Named Colours” and RGB
with hexadecimal representation
- intensity in general: keep as it is but may need to define a range for Celsius,
Beaufort, Richter.
- intensity for heating/cooling: seems to be a general issue as it is not clear how
temperature is perceived by individual persons – there’s a need for (room)
temperature classification like it is done for http://en.wikipedia.org/wiki/Lux
- FlashType: ok, adopted but frequency need not be restricted
- ColorLightType: merged to LightType
- Shading vs. Window Blind: range [closed, opened], for speed the scale is not clear
and we’ll adopt only ‘slow’ and ‘fast’ for the moment
m16204
Summary:
- Concept, Scope of standard
Evaluations:
- Agreed through e-mail reflector.
m16203
Summary:
- Detail information for implementation
Evaluations:
- Need more discussion in BoG meeting
Technical Work in Progress.
14.3
MPEG-V
14.3.1
Topics
1.
14.3.2
Information exchange with Virtual Worlds
General Issues
 Harmonization with RoSE
o Agreed on the structure of harmonized standard. There will be no
differentiation of schemas for “information exchange between virtual
worlds” and “information exchange between virtual worlds and real
worlds”
o Proposed structure
 Part 1 Architecture
 Part 2 Control Information
 Part 3 Sensory Information
84
 Part 4 Avatar Information (TBD)
o Schemas to represent virtual world information will be categorized based
on the characteristics of what is described and will form a separate parts
14.3.3
Session
MPEGV
MPEGV
MPEGV
MPEGV
Contributions
Numbe
r
m1607
2
m1608
0
m1617
4
m1605
1
Title
Authors
MPEG-V CfP Response
Jean H.A. Gelissen (ed)
MPEG-V CfP Response
Jean H.A. Gelissen (ed)
MPEG-V CfP Response
Jean H. A. Gelissen (Ed)
Full motion control and navigation of
avatar/object with multi-input sources in
MPEG-V
jeong-hwan ahn
The first conclusions from the evaluation of these contributions are that they address the
following requirement areas of MPEG-V as indicated in the contributions:
m16051: Requirements related to ‘Data representations between virtual worlds and the real
world’
m16072: Requirements related to ‘Data representations between virtual worlds’
m16080: Requirements related to ‘Data representations between virtual worlds and the real
world’
m16174: Requirements related to ‘Data representations between virtual worlds and the real
world’
As mentioned in the contributions m16072, m16080, and m16051are in an early stage, in the
case of m16174 even at the stage of a more detailed description of the requirements, due to
the early stage of the related activities. They all also mention however that more detailed
contributions will be submitted for next MPEG meetings.
85
15
Exploration
15.1
Richmedia UI Framework
15.1.1
Topics
1.
Richmedia UI Frameworks
15.1.2
Sessio
n
Scene
Contributions
Numbe
Title
r
m1603 Items under considerations in Rich UI
8
Framework
Authors
Kyungmo Park
Cyril Concolato
Jean Le Feuvre
Giovanni Cordara
m16038 Prposed update to Context and Objectives of RMUIF.
 the goal of this activity is to integrate the MPEG RMUIF with the existing MPEG
delivery systems: the ISO base media file format, the MPEG-2 TS and MPEG-4
Part 8 (4onIP).
 The MPEG framework should be divided in two parts: the first part agnostic of the
presentation and focused on the delivery and the second part agnostic of the
delivery and focused on the presentation.
15.2
Advanced IPTV Terminal
15.2.1
Topics
1.
15.2.2
Sessio
n
AIT
AIT
AIT
AIT
AIT
AIT
AIT
Advanced IPTV Terminal
Contributions
Numbe
r
m1600
8
m1600
9
m1601
0
m1601
1
m1601
2
m1601
3
m1601
4
Title
Authors
Use cases for consideration by Ad Hoc
Group on Advanced IPTV Terminal
Technologies for consideration by Ad Hoc
Group on Advanced IPTV Terminal
Peer-to-Peer iDRM
Leonardo Chiariglione
Web, Internet and Mobile TV
Filippo Chiariglione
,Tiejun Huang
Leonardo Chiariglione
The Digital Media in Italia proposal
Open IPTV Platform For an Open Content
Market
Approaching the Zettabyte Era
86
Leonardo Chiariglione
Walter Allasia
Lucia Marchisio
Young-Kwon LIM
AIT
AIT
AIT
m1601
5
m1616
8
Contribution to the scope of the planned
Advanced IPTV Terminal standard
Use cases and Requirements for Advanced
Internet TV Terminals
m1618
8
Proposal of Advanced IPTV Terminal (AIT)
requirements
Young-Kwon LIM
Christian Timmerer
Mark Stuard
Franc Kozamernik
Jari Ahola
Leonardo Chiariglione
m16008, m16009, m16010, m16011, m16012, m16013 and m16014 were reviewed at
Torino AHG meeting and integrated to produce m16015 proposing more detailed scope of
AIT project
m16168 This contribution provides commercial use cases and requirements derived from
them especially in a Peer-to-Peer environment. Agreed to adopt some use cases as starting
points to develop AIT use cases.
m16188 This contribution proposed refined and restructured requirements on AIT.
15.3
AOB
15.3.1
Joint meeting with Video & Requirement
Question about the modern way of transportation of multimedia content is raised. Five
possible areas might be useful to explorer are identified. This seems to be HVC System work
item…
 Transport- and file format friendly stream format (David Singer, Apple)
 Cross layer optimization between video and transport layer (Young-Kwon)
 Error resilience for MPEG streams (Joern)
 4th bullet (Friendliness with other transport mechanism (Leonardo)
 Content adaptation to different networks (Christian)
16
Liaison
Cf. Liaison output.
87
17
Latest References and Publication Status
Reference on the ISO Web Site : http://www.itscj.ipsj.or.jp/sc29/open/29view/29n9270c.htm
Pr
Pt
2
1
1
1
1
1
ISO/IEC 13818-1/Amd.7
1
1
ISO/IEC 13818-1:2000/Amd.2 (Support for IPMP on 2)
1
1
ISO/IEC 13818-1:2000/Amd.4 (Metadata Application CP)
1
1
1
ISO/IEC 13818-1:2000/COR3 (Correction for Field Picture)
ISO/IEC 13818-1:2006 (MPEG-2 Systems 3rd Edition)
2
1
1
ISO/IEC 13818-1:2006/Amd.1 (Transport of Streaming text)
N8369
2
1
ISO/IEC 13818-1:2006/Amd.2 (Carriage of Auxialiry Video Data)
N8798
2
2
2
2
2
2
2
2
2
2
2
2
Standard
No.
Issue
ISO/IEC 13818-1:2000 (MPEG-2 Systems 2nd Edition)
ISO/IEC 13818-1:2000/COR1 (FlexMux Descr.)
ISO/IEC 13818-1:2000/COR2 (FlexMuxTiming_ descriptor)
ISO/IEC 13818-1:2000/Amd.1 (Metadata on 2) & COR1 on Amd.1
ISO/IEC 13818-1:2000/Amd.3 (AVC Carriage on MPEG-2)
ISO/IEC 13818-1:2000/Amd.5 (New Audio P&L Sig.)
ISO/IEC 13818-1:2000/COR4 (M4MUX Code Point)
ISO/IEC 13818-1:2000/COR5 (Corrections related to 3rd Ed.)
00/12
N3844
N4404
N5867
N5604
N5771
N6847
N6585
N6845
N7469
N7895
01/01 Pisa
01/12 Pattaya
03/07
Trondheim
03/03 Pattaya
03/07
Trondheim
04/10 Palma
04/07
Redmond
04/10 Palma
05/07 Poznan
06/01
Bangkok
06/xx
88
06/07
Klagenfurt
07/01
Marrakech
Status
Doc. With
Purpose
Published
Published
Published
Published
Published
2000/12
2000/12
2002/03
2002/12
2003/12
ISO
Award
Done
Proposed
N/A
N/A
Proposed
Published
Published
2004/03
XXXX
N/A
Proposed
FDAM
FDAM
ITTF
ITTF
to be published
to be published
N/A
N/A
COR
COR
COR
ITTF
ITTF
ITTF
to be published
to be published
to be published
N/A
N/A
N/A
Published
FDAM
ITTF
ITTF
to be published
TBP
TBP
FDAM
ITTF
to be published
TBP
2
1
ISO/IEC 13818-1:2006/Cor.1.2 (Reference to AVC Specification)
N9365
2
1
ISO/IEC 13818-1:2006/Amd.3 (SVC in MPEG-2 Systems)
2
1
ISO/IEC 13818-1:2006/Cor.2 (Corrections to SVC in MPEG-2)
2
11
1
N1005
8
N1024
0
N5607
N2501
4
4
ISO/IEC 13818-1:2003 (IPMP on 2)
st
ISO/IEC 14496-1 (MPEG-4 Systems 1 Ed.)
07/10
Shenzhen
08/07
Hannover
08/10 Busan
FDAM
ITTF
to be published
TBP
FDAM
ITTF
to be published
TBP
COR
ITTF
to be published
TBP
03/03 Pattaya
98/10 Atl. City
Published
Published
2003/12
1999/12
Proposed
Done
Published
Published
2001/11
2001/11
Done
N/A
Published
Published
COR
COR
2001/11
2002/10
ITTF
ITTF
N/A
Done
N/A
N/A
COR
ITTF
N/A
AMD
ITTF
N/A
Published
2004-05
N/A
Published
Published
2003/12
2004-08
N/A
N/A
AMD
PDAM
ITTF
ITTF
PDAM
IS
ITTF
ITTF
1
1
ISO/IEC 14496-1/Amd.1 (MP4, MPEG-J)
ISO/IEC 14496-1/Cor.1
N3054
N3278
ISO/IEC 14496-1:2001 (MPEG-4 Systems 2nd Ed.)
N3850
4
1
1
1
1
99/12 Hawaii
00/03
Noordwijk.
01/01 Pisa
ISO/IEC 14496-1:2001/Cor.2
N4264
N5275
01/07 Sydney
02/10 Shangai
4
1
ISO/IEC 14496-1:2001/Cor.3
N6587
4
1
ISO/IEC 14496-1:2001/Amd.2 (Textual Format)
N4698
4
1
ISO/IEC 14496-1:2001/Amd.3 (IPMP Extensions)
N5282
4
1
1
ISO/IEC 14496-1:2001/Amd.4 (SL Extension)
N5471
N5976
1
1
ISO/IEC 14496-1:2001/Amd.8 (ObjectType Code Points)
N6202
N7229
04/07
Redmond
02/03 Jeju
Island
02/10
Shanghai
02/12 Awaji
03/10
Brisbanne
03/12 Hawaii
05/04 Busan
1
1
ISO/IEC 14496-1:200x/Cor4 (Node Coding Table)
N7473
N5277
05/07 Poznan
02/10
4
4
4
4
4
4
4
4
4
ISO/IEC 14496-1:2001/Amd.1 (Flextime)
ISO/IEC 14496-1:2001/Cor.1
ISO/IEC 14496-1:2001/Amd.7 (AVC on 4)
ISO/IEC 14496-1:200x/Amd.1 (Text Profile Descriptors)
ISO/IEC 14496-1 (MPEG-4 Systems
3rd
Ed.)
89
to be published
Final Text
Editing
to be published
to be published
N/A
N/A
N/A
Proposed
4
1
ISO/IEC 14496-1:200x/Amd.1 (Text Profile Descriptors)
N7229
4
1
ISO/IEC 14496-1:200x/Cor.1 (Clarif. On audio codec behavior)
N8117
4
1
ISO/IEC 14496-1:200x/Amd.2 (3D Profile Descriptor Extensions)
N8372
4
1
ISO/IEC 14496-1:200x/Cor.2 (OD Dependencies)
N8646
4
1
ISO/IEC 14496-1:200x/Amd.3 (JPEG 2000 support in Systems)
N8860
4
4
ISO/IEC 14496-1:200x/Amd.17 (ATG Conformance)
N8861
4
4
ISO/IEC 14496-1:200x/Amd.22 (AudioBIFS v3 conformance)
N9295
4
4
ISO/IEC 14496-1:200x/Amd.23 (Synthesized Texture conformance)
N9369
4
4
ISO/IEC 14496-1:200x/Amd.24 (File Format Conformance)
N9370
4
4
ISO/IEC 14496-1:200x/Amd.25 (LASeR V1 Conformance)
N9372
4
4
ISO/IEC 14496-1:200x/Amd.26 (Open Font Format Conf.)
N9815
4
4
ISO/IEC 14496-1:200x/Amd.27 (LASeR Amd.1 Conformance)
N9816
4
5
ISO/IEC 14496-1:200x/Amd.12 (File Format)
N9020
4
5
ISO/IEC 14496-1:200x/Amd.14 (Open Font Format)
4
5
5
6
ISO/IEC 14496-1:200x/Amd.16 (SMR Ref. Soft)
N1024
6
N9672
N9674
4
4
ISO/IEC 14496-1:200x/Amd.17 (LASeR Ref. Soft)
ISO/IEC 14496-6:2000
90
Shanghai
05/04 Busan
06/04
Montreux
06/07
Klagenfurt
06/10
Hangzhou
07/01
Marrakech
07/01
Marrakech
07/07
Lausanne
07/10
Shenzhen
07/10
Shenzhen
07/10
Shenzhen
08/04
Archamps
08/04
Archamps
07/04 San Jose
08/10 Busan
08/01 Antalya
08/01 Antalya
N/A
ITTF
Final Text
Editing
Final Text
Editing
to be published
COR
ITTF
to be published
N/A
PDAM
ITTF
to be published
N/A
PDAM
ITTF
to be published
N/A
PDAM
ITTF
to be published
N/A
PDAM
ITTF
to be published
N/A
PDAM
ITTF
to be published
N/A
PDAM
ITTF
to be published
N/A
PDAM
ITTF
to be published
N/A
PDAM
ITTF
to be published
N/A
PDAM
ITTF
to be published
N/A
PDAM
ITTF
to be published
N/A
PDAM
PDAM
Published
ITTF
ITTF
2000/12
to be published
to be published
N/A
N/A
N/A
PDAM
ITTF
COR
ITTF
PDAM
N/A
N/A
4
ISO/IEC 14496-8 (MPEG-4 on IP Framework)
4
8
11
4
11
ISO/IEC 14496-11/Amd.1 (AFX)
4
11
4
11
Published
FDIS
2004-05
SC29
N5480
02/03 Jeju
05/01
HongKong
02/12 Awaji
FDAM
ITTF
ISO/IEC 14496-11/Amd.2 (Advanced Text and Graphics)
N6205
03/12 Hawaii
FDAM
ITTF
ISO/IEC 14496-11/Cor.1
N6203
03/12 Hawaii
COR
SC29
ISO/IEC 14496-11 (MPEG-4 Scene Description 3rd
Edition)
N4712
N6960
4
11
ISO/IEC 14496-11/Cor.3 Valuator/AFX related correction N6594
4
11
ISO/IEC 14496-11/Amd.3 Audio BIFS Extensions
N6591
4
11
ISO/IEC 14496-11/Amd.4 XMT and MPEG-J Extensions
N6959
4
11
ISO/IEC 14496-11/Cor.3 (Audio BIFS Integrated in 3rd Edition)
N7230
4
11
ISO/IEC 14496-11/Cor.5 (Misc Corrigendum)
N8383
4
11
ISO/IEC 14496-11/Amd.5 Symbolic Music
Representation
N8657
4
ISO/IEC 14496-11/Cor.6 (AudioFx Correction)
4
11
12
ISO/IEC 14496-12 (ISO Base Media File Format)
N9021
N5295
4
12
ISO/IEC 14496-12/Amd.1 ISO FF Extension
N6596
4
12
N7232
4
12
ISO/IEC 14496-12/Cor.1 (Correction on File Type
Box)
ISO/IEC 14496-12/Cor.2 (Miscellanea)
4
12
ISO/IEC 14496-12/Amd.1 (Description of timed
metadata)
N8659
N7901
91
04/07
Redmond
04/07
Redmond
05/01
HongKong
05/04 Busan
06/07
Klagenfurt
06/10
Hangzhou
07/04 San Jose
02/10
Shanghai
04/07
Redmond
05/04 Busan
06/01
Bangkok
06/10
Hangzhou
Final Text
Editing
Integration in 1st
Ed.
Integration in 1st
Ed.
Proposed
Proposed
N/A
N/A
N/A
st
Integration in 1
Ed.
Integration in 1st
Ed.
Integration in 1st
Ed.
Final Text
Editing
N/A
COR
ITTF
FDAM
ITTF
FDAM
ITTF
COR
ITTF
COR
SC29
N/A
FDAM
ITTF
TBP
COR
Published
SC29
2004-02
N/A
Proposed
FDAM
ITTF
FDAM 04/11/30
N/A
COR
ITTF
N/A
COR
ITTF
Final Text
Editing
Final Text
Editing
FDAM
ITTF
Proposed
N/A
N/A
N/A
N/A
4
12
ISO/IEC 14496-12/Cor.3 (Miscellanea)
N9024
07/04 San Jose
COR
ITTF
4
07/04 San Jose
FDAM
ITTF
N/A
4
12
COR
ITTF
N/A
13
N1025
0
N5284
08/10 Busan
4
ISO/IEC 14496-12/Amd.2 (Flute Hint Track)
ISO/IEC 14496-12:2008 (ISO Base Media File Format
2nd ed.)
ISO/IEC 14496-12:2008/Cor.1 (Corrections to Flute
Hint Track)
ISO/IEC 14496-13 (IPMP-X)
N9023
4
12
12
IS
ITTF
4
14
ISO/IEC 14496-14 (MP4 File Format)
N5298
Published
2003-11
4
14
ISO/IEC 14496-14/Cor.1 (Audio P&L Indication)
N7903
COR
ITTF
4
15
ISO/IEC 14496-15 (AVC File Format)
N5780
Published
2004-04
4
15
ISO/IEC 14496-15/Amd.1 (Support for FREXT)
N7585
02/10
Shanghai
02/10
Shanghai
06/01
Bangkok
03/07
Trondheim
05/10 Nice
FDAM
ITTF
4
4
15
15
ISO/IEC 14496-15/Cor.1
ISO/IEC 14496-15/Cor.2 (NAL Unit Restriction)
N7575
N8387
COR
COR
ITTF
ITTF
N/A
N/A
4
15
N9682
FDAM
ITTF
N/A
4
17
18
18
ISO/IEC 14496-15/Amd.2 (SVC File Format
Extension)
ISO/IEC 14496-17 (Streaming Text)
ISO/IEC 14496-18 (Font Compression and Streaming)
ISO/IEC 14496-18/Cor.1 (Misc. corrigenda and
clarification)
ISO/IEC 14496-19 (Synthesized Texture Stream)
ISO/IEC 14496-20 (LASeR)
ISO/IEC 14496-20/Cor.1 (Misc. corrigenda and
clarification)
FDAM
Published
COR
ITTF
2004-07
ITTF
TBP
Proposed
N/A
Published
FDAM
COR
2004-07
Editor
ITTF
Proposed
TBP
N/A
4
4
4
4
4
19
20
20
N7479
N6215
N8664
N6217
N7588
N8666
92
05/10 Nice
06/07
Klagenfurt
08/01 Antalya
05/07 Poznan
03/12 Hawaii
06/10
Hangzhou
03/12 Hawaii
05/10 Nice
06/10
Hangzhou
Final Text
Editing
to be published
N/A
Proposed
Proposed
Final Text
Editing
N/A
Proposed
Final Text
Editing
N/A
4
4
20
20
ISO/IEC 14496-20/Amd.1 (LASeR Extension)
ISO/IEC 14496-20/Cor.2 (Profile Removal)
N9029
N9381
4
20
ISO/IEC 14496-20/Amd.2 (SVGT1.2 Support)
N9384
4
22
ISO/IEC 14496-22 (Open Font Format)
N8395
7
1
ISO/IEC 15938-1 (MPEG-7 Systems)
N4285
7
ISO/IEC 15938-1/Amd.1 (MPEG-7 Systems Extensions)
7
1
1
1
1
2
7
ISO/IEC 15938-7/Amd.2 (Fast Access Ext. Conformance)
N6326
N6328
N7490
N7532
N4288
N8672
7
12
ISO/IEC 15938-12 MPEG Query Format
N9830
21
7
21
7
ISO/IEC 21000-7 AMD 1 (DIA Query Format
Capability)
ISO/IEC 21000-7 COR 1 (Corrections)
21
8
ISO/IEC 21000-8 AMD 1 (Extra Ref. SW)
21
9
ISO/IEC 21000-9 (MPEG-21 File Format)
N1026
0
N1026
2
N6975
21
9
ISO/IEC 21000-9/Amd.1 (MPEG-21 Mime Type)
N9837
21
15
ISO/IEC 21000-15 (Security in Event Reporting)
N9839
7
7
7
7
ISO/IEC 15938-1/Cor.1 (MPEG-7 Systems Corrigendum)
ISO/IEC 15938-1/Cor.2 (MPEG-7 Systems Corrigendum)
ISO/IEC 15938-1/Amd.2 (BiM extension)
ISO/IEC 15938-2 (MPEG-7 DDL)
93
07/04 San Jose
07/10
Shenzhen
07/10
Shenzhen
06/07
Klagenfurt
01/07 Sydney
FDAM
FDAM
ITTF
ITTF
N/A
N/A
FDAM
ITTF
N/A
FDAM
Editor
Published
2002/07
FDAM
COR
COR
FDAM
Published
FDAM
ITTF
Editor
ITTF
ITTF
2002/02
ITTF
FDIS
ITTF
FDAM
ITTF
08/10 Busan
COR
ITTF
08/10 Busan
FDAM
ITTF
05/01
HongKong
08/04
Archamps
08/04
Archamps
FDIS
ITTF
FDAM
ITTF
Done
FDIS
ITTF
TBP
04/03 Munich
04/03 Munich
05/07 Poznan
05/10 Nice
01/07 Sydney
06/10
Hangzhou
08/04
Archamps
Final Text
Editing
TBP
Done
FDAM 04/11/28
N/A
N/A
N/A
N/A
Done
N/A
N/A
FDIS 05/01/21
Done
21
A
16
5
4
4
ISO/IEC 21000-16 (MPEG-21 Binary Format)
ISO/IEC 21000-5 (Open Release Content Profile)
ISO/IEC 23000-4 (Musical Slide Show MAF)
ISO/IEC 23000-4 (Musical Slide Show MAF 2nd Ed.)
N7247
N9687
N9037
N9843
A
4
ISO/IEC 23000-4 AMD 1 (MSS AF Conf. & Ref. SW)
A
6
ISO/IEC 23000-6 (Professional AF)
A
A
7
7
ISO/IEC 23000-7 (Open Access MAF)
ISO/IEC 23000-7 AMD 1 (OA AF Conf. & Ref. SW)
A
8
ISO/IEC 23000-8 (Portabe Video AF)
N1026
7
N1026
9
N9698
N1027
4
N9853
A
9
ISO/IEC 23000-9 (Digital Multi. Broadcasting MAF)
N9397
A
9
N9854
A
10
ISO/IEC 23000-9/Cor.1 (Digital Multi. Broadcasting
MAF)
ISO/IEC 23000-10 (Video Surveillance AF)
A
11
ISO/IEC 23000-11 (Stereoscopic Video AF)
B
B
1
1
B
1
B
1
ISO/IEC 23001-1 (XML Binary Format)
ISO/IEC 23001-1/Cor.1 (Misc. Editorial and technical
clar.)
ISO/IEC 23001-1/Cor.2 (Misc. Editorial and technical
clar.)
ISO/IEC 23001-1/Amd.1 (Reference Soft. & Conf.)
B
1
ISO/IEC 23001-1/Amd.1 (Exten. On encoding of wild
21
A
N1027
9
N1028
3
N7597
N8680
N9049
N8886
N9296
94
05/04 Busan
08/01 Antalya
07/04 San Jose
08/04
Archamps
08/10 Busan
FDIS
FDAM
FDIS
FDIS
ITTF
ITTF
ITTF
ITTF
FDAM
ITTF
TBP
FDIS
ITTF
TBP
FDIS
FDAM
ITTF
ITTF
TBP
TBP
08/04
Archamps
07/10
Shenzhen
08/04
Archamps
08/10 Busan
FDIS
ITTF
TBP
FDIS
ITTF
TBP
COR
ITTF
TBP
FDIS
ITTF
TBP
08/10 Busan
FDIS
ITTF
TBP
05/10 Nice
06/10
Hangzhou
07/04 San Jose
FDIS
COR
ITTF
ITTF
TBP
N/A
COR
ITTF
N/A
FDAM
ITTF
N/A
PDAM
ITTF
08/10 Busan
08/01 Antalya
08/10 Busan
07/01
Marrakech
07/07
FDIS 05/04/22
to be published
TBP
TBP
TBP
TBP
N/A
E
2
3
1
cards)
ISO/IEC 23001-2 (Fragment Request Unit)
ISO/IEC 23001-3 (IPMP XML Messages)
ISO/IEC 23008-1 Architecture
N9051
N9416
N8892
E
2
ISO/IEC 23008-2 Multimedia API
N8893
E
3
ISO/IEC 23008-3 Component Model
N8894
E
4
ISO/IEC 23008-4 Ressource & Quality Management
N8895
E
5
6
7
1
ISO/IEC 23008-5 Component Download
ISO/IEC 23008-6 Fault Management
ISO/IEC 23008-7 System Integrity Management
ISO/IEC 29116 Media Streaming MAF Protocols
N9053
N9054
N9055
N9420
B
B
E
E
29116
95
Lausanne
07/04 San Jose
07/04 San Jose
07/01
Marrakech
07/01
Marrakech
07/01
Marrakech
07/01
Marrakech
07/04 San Jose
07/04 San Jose
07/04 San Jose
07/10
Shenzhen
FDIS
FDIS
FDAM
ITTF
ITTF
ITTF
TBP
TBP
N/A
FDAM
ITTF
N/A
FDAM
ITTF
N/A
FDAM
ITTF
N/A
FDAM
FDAM
FDAM
FDAM
ITTF
ITTF
ITTF
ITTF
N/A
N/A
N/A
N/A
18
Resolutions of Systems
Cf. WG11 resolution.
96
Annex G – Video report
Sources: Jens-Rainer Ohm, Gary Sullivan, Paul Brasnett, Euee S. Jang
1 Development of AVC
The video subgroup jointly approved the output documents relating to ISO/IEC approval process
milestones that were produced during the 30th JVT meeting which was held in Geneva (2009-0130/02-03. Important work items in this context were
– Work on MVC software and conformance (both reaching FPDAM at this meeting)
– Work towards a new corrigendum (defect report issued on miscellaneous issues related to
MVC and SVC
– Work towards a new amendment containing Constrained Baseline Profile and a new SEI
message defining various interleaving methods for left/right stereo views in a conventional
(monoscopic) video (reaching FPDAM at this meeting).
Further discussion was performed jointly with the Requirements subgroup on possible definition of
a new profile that would allow usage of MVC for stereo video captured by interlaced cameras or
encoded in interlaced mode, as requested by the National Bodies of Japan, Singapore and the U.S.
This topic had also been discussed in the 30th JVT meeting, and the JVT had issued a "profile under
consideration" description document on the subject. The current multiview high profile does not
support so-called interlaced coding tools. It was asserted by some participants that there is need for
such support because interlaced video camera capture will still continue to be used, including for
stereoscopic applications. Technically, it is certainly feasible to include interlaced coding tools such
as MBAFF, because MVC does not change the AVC operation below the slice header level. Orally,
the companies Panasonic, Motorola and Mitsubishi expressed support for such a possible profile.
To guarantee a bug-free definition, it will be necessary to implement software. This is expected to
be submitted by proponents by the April 2009 meeting, such that a PDAM could be started by that
time. A WD was issued on this to allow further study.
Documents reviewed:
m15976
m15981
m15986
m15998
m16002
M16022
M16032
M16071
Table of Replies on ISO/IEC 14496-5:2001/FDAM 18 [SC 29 N 9928]
Summary of Voting on ISO/IEC 14496-5:2001/PDAM 15
Table of Replies on ISO/IEC 14496-4:2004/FDAM 30
Summary of Voting on ISO/IEC 14496-4:2004/PDAM 38
Summary of Voting on ISO/IEC 14496-10:200X/PDAM 1
JNB comment on the resolution 3.5.4
SGNB Comments on Multiview Video Coding Profile
Response to resolution 3.5.4 of 86-th WG 11 meeting
ITTF via SC 29 Secretariat
SC 29 Secretariat
ITTF via SC 29 Secretariat
SC 29 Secretariat
SC 29 Secretariat
Japan National Body
Singapore National Body
Andy Tescher for USNB
Documents approved:
No.
Title
10337 Disposition of Comments on ISO/IEC 14496-4:2004/PDAM 38
10338 Text of ISO/IEC 14496-4:2004/FPDAM 38 Multiview Video
Coding Conformance Testing
10339 Disposition of Comments on ISO/IEC 14496-5:2001/PDAM 15
10340 Text of ISO/IEC 14496-5:2001/FPDAM 15 Reference Software for
Multiview Video Coding
10341 Disposition of Comments on ISO/IEC 14496-10:200X/PDAM 1
97
TBP Available
N
09/02/06
N
09/02/20
N
N
09/02/06
09/02/20
N
09/02/06
10342
10343
10344
Text of ISO/IEC 14496-10:200X/FPDAM 1 Constrained Baseline
Profile and supplemental enhancement information
Defect Report on ISO/IEC 14496-10:200X
Working Draft 1 of ISO/IEC 14496-10:200X/Amd.2 Multiview
Field High Profile
N
09/02/20
N
Y
09/02/06
09/02/06
2 MPEG-7 Visual and Photo Player MAF
2.1 MPEG-7 Visual related work
The MPEG-7 breakout group was active during the whole week. Input documents related to the
Visual part in 15938-3 and Photo Player MAF 23003-3 are listed in the table below.
m16112
m16113
Proposal on removing comparison pair in comparison list for
independence test
Ground true table & incorrect video query clips of AVC, CC
m16171
Response to the Call for Proposals on Video Signature Tools
m16172
Response to the Call for Proposals on Video Signature Tools
m16178
Received responses to CfP on Video Signature Tools
Sangil Na
DongSeok Jeong
WonGeun Oh
JuKyong Jin
Kota Iwamoto
Ryoma Oami
Paul Brasnett
Stavros Paschalakis
Miroslaw Bober
Jens-Rainer Ohm
For the Photo Player MAF, the conformance testing amendment was prepared (PDAM2).
The main activity during the week was the evaluation of the responses received after the Call for
Proposals on Video Signature Tools, and preparation of the related Core experiments. A summary
of the responses received is given in the subsequent subsections.
2.1.1
Proposal 1: Video signature based on feature difference between various
pairs of sub-regions (NEC)
Frame signature representing relations between values of various pairs of sub-regions
 Mean Success Ratio = 93.24% (92.32%)(<5ppm)
 More than 93% SR for all modifications at “light” level for direct and partial matching of
D=10sec
 Descriptor size = 14.64 kbps
 Matching speed = 1,774 clips/sec (Partial Matching)
dimension 1
dimension 2
dimension 3
dimension 4
dimension 5
dimension 6
dimension 7
dimension 8
dimension 9
dimension 10
1. Resizing (to 320x240)
2. Mean value calculation of sub-regions
98
3. Comparison & Quantization
4. Encoding
frame #1
header frame signature
(60 bytes)
frame #2
confidence
(1 byte)
frame signature
(60 bytes)
confidence
(1 byte)
…
sequence1
sequence 2
Strengths
 Highest reported performance.
 Only complete set of results.
 Consistent performance across all
modifications.
 Ternary values – concept of zero
– additional robustness.
Weaknesses
 Descriptor training – part of test
data used, overfitting? Some test
conditions used, overfitted?
 Assumes framerate.
 Potentially fragile regions.
2.1.2
Proposal 2: Video Signature based on Video Tomography (Florida
Atlantic University)
Extraction: On Intel Core 2, 2.4 GHz, 65 milli sec for 180 frames
Descriptor Size: 6.144 Kbps
The proposed approach is based on video tomography - NTT Research Labs, 1994
A tomographic image is created from a set of video frames, e.g, a shot in a movie
Captures spatio-temporal changed in a shot.
FRAME I + 2
FRAME
Frame
i I+1
HT
TH[I][J] = FI[HT][J]
0≤J<W
0≤I<S
3
FRAME I
H/2
S
VIDEO TOMOGRAPH
H - HT
WT
W - WT
5
W
TV[I][J] = FI[J][WT]
0≤J<H
0≤I<S
S
H
Edge Map of Tomography
99
4
1
2
6
•
•
360 pixels are samples along each line
Only luminance is used
Video Clip
Pattern 1
Tomograph
Pattern 2
Tomograph
Pattern 3
Tomograph
Pattern 4
Tomograph
Pattern 5
Tomograph
Pattern 6
Tomograph
Edge
Detection
Edge
Detection
Edge
Detection
Edge
Detection
Edge
Detection
Edge
Detection
Composite Edge Image
Composite Edge Image
Composite Edge Image
16-byte Sub-Signature
16-byte Sub-Signature
16-byte Sub-Signature
48-byte Signature
•
Matching in two stages:
– Matching closest shot (segment) using shot level signatures
– Matching closest frame using frame signatures
Strengths
Weaknesses
 Exploits temporal information.
 Incomplete performance results.
 Efficient descriptor extraction.
 Limited region of frame
considered – may be simple to
 Initial results suggest good
attack?
robustness – CIVR 2007 dataset.
 Assumes frame rate.
2.1.3
Proposal 3: Video Signature based on Multi-resolution decomposition
(Mitsubishi Electric)
Descriptor Extraction: ~1 ms per frame
Descriptor Size: 8.96 kbps
Matching: ~2,500 per second
Detection: ~ 75.21% for 5 s at 5 ppm - Direct matching
 A frame is divided into 2x2 neighbourhoods
 In each 2x2 neighbourhood, the relation between the descriptor elements {a’, b’, c’, d’} and
the pixel values {a, b, c, d} of the neighbourhood is given by (1)-(4)




a’ = (a+b+c+d)/4
(1)
b’ = (a-b)/2
(2)
c’= (b-d)/2
(3)
d’ = (d-c)/2
(4)
Multi-resolution scheme – applied at 4 scales 2x2, 4x4, 8x8 and 16x16
Element binarisation by keeping MSB
A word is formed by projecting descriptor to lower dimension space
Multiple projections → multiple words, each from a different vocabulary
100
~
di
~
di
bits at 2x2
resolution
~
di
bits at 4x4
resolution
~
di
bits at 8x8
resolution
bits at 16x16
resolution
0
1
4
8
5
11
20
36
21
39
22
42
23
45
84
3
2
10
9
13
12
38
37
41
40
44
43
47
46
150 149 153 152 156 155 159 158 162 161 165 164 168 167 171 170
148
6
14
7
17
24
48
25
51
26
54
27
57
92
16
15
19
18
50
49
53
52
56
55
59
58
174 173 177 176 180 179 183 182 186 185 189 188 192 191 195 194
28
60
29
63
30
66
31
69
100 196 101 199 102 202 103 205 104 208 105 211 106 214 107 217
62
61
65
64
68
67
71
70
198 197 201 200 204 203 207 206 210 209 213 212 216 215 219 218
32
72
33
75
34
78
35
81
108 220 109 223 110 226 111 229 112 232 113 235 114 238 115 241
74
73
77
76
80
79
83
82
222 221 225 224 228 227 231 230 234 233 237 236 240 239 243 242
172
85
93
151
175
86
94
154
178
87
95
157
181
88
96
160
184
89
97
163
187
90
98
166
190
91
99
169
193
116 244 117 247 118 250 119 253 120 256 121 259 122 262 123 265
Word 1
Word 2
246 245 249 248 252 251 255 254 258 257 261 260 264 263 267 266
Word 3
Word 4
124 268 125 271 126 274 127 277 128 280 129 283 130 286 131 289
Word 5
270 269 273 272 276 275 279 278 282 281 285 284 288 287 291 290
132 292 133 295 134 298 135 301 136 304 137 307 138 310 139 313
In each 2x2 neighbourhood, the relation between the
~
d ielements and the pixel values
294 293 297 296 300 299 303 302 306 305 309 308 312 311 315 314
a, b, c, d of the neighbourhood is given by eq. (1)-(4), i.e
140 316 141 319 142 322 143 325 144 328 145 331 146 334 147 337
318 317 321 320 324 323 327 326 330 329 333 332 336 335 339 338



a
b
(a+b+
(a-b)/2
c+d)/4
c
d
(d-c)/2 (b-d)/2
1 stage matching – Word occurrence
2nd stage matching – Temporal ordering of words
3rd stage matching – full descriptor matching
 Frame Rate Ratio & Frame Displacement estimation
Strengths
Weaknesses
 Efficient descriptor extraction.
 No partial matching results.
 Multi-scale representation
 Current bit selection of the words
provides robustness
could impact performance.
 Does not assume frame rate.
 Weakness on frame rate reduction
and camera capturing.
 Efficient search with words.
st
2.1.4
N.B. No assumption of frame rate makes matching problem harder for all
conditions.
2.1.5
Proposal 4: Hierarchical video signature description using existing
MPEG-7 features and motion activity (University of Brescia)
Descriptor Size: Dependent on descriptors used e.g. 4.96 kbps for 4 proposed descriptors
Extraction & Matching Complexity: Dependent on descriptors used




Description Scheme
Hierarchical Temporal Structure
Flexibility
Possibility to choose and use many descriptors e.g.
 Standard - Dominant Color, Color Layout
 Non standard - Motion Activity Map (MAM), Direction of Motion Activity (DMA)
header
101
data
VS:

Matching
o The proposed method is exhaustive
o To speed up the comparison
 Pre-processing: clustering of D associated to each segment
Strengths
 Flexibility in implementation and
use of descriptors.
 Hierarchical Temporal Structure –
supports granular searching.
 Schema is compatible with
existing MPEG-7 standard.
2.1.6
Weaknesses
 Incomplete performance results.
 Possibility of no common
descriptors between sequences →
not possible to judge match.
 Variable extraction cost.
Proposal 5: Video signature based on saliency map (Peking University)
Descriptor Size: 16bits/frame, 10FPS → 0.160 kbps
Independence rates: 8.65% for direct 2 seconds condition, 2.05% for direct 5 seconds, 0.5% for
direct 10 seconds condition, 0.02% for partial condition (30 seconds)
Robustness: from 80-99% @ ~10,000 ppm


Saliency map – combines colour, intensity & orientation information.
Saliency map simulates the attention model of the human visual system.
Extraction Process
Input video frames
Pre-processing
Partition the saliency
map to M×N blocks
Strengths
 Very compact descriptor.
 Very fast matching.
 HVS model may provide
robustness.
Extract visual
features maps
Sum saliency values
within each block, and
adaptively threshold
Construct a
saliency map
Video fingerprints
Weaknesses
 Sensitive to text/logo overlay &
widescreen bars etc.
 High complexity extraction.
 High false alarm rates.
102
2.2 Conclusions and next actions
With the current information, comparison between the methods that were submitted as responses to
the CfP is difficult. Only one proposal came with complete results, while all others came with
incomplete results due to late availability of the extremely large data set. For the one proposal with
complete results, it was not fully clear by how much the fact that queries were partially used to train
the descriptor would affect the performance.
Further, there were some small problems with the ground truth definition, and it was only detected
during the experimentation that some of the data contain black sequences which could bias the
results both in correct and false matching. Therefore, it was decided to not decide for one particular
algorithm by adoption into the XM, but rather continue with core experiments, using corrected
testing conditions, such that it will be possible to achieve a more clear evidence about the benefits
of the various technologies by the next meeting (see N10345).
Some issues were further discussed, but for the current time were not taken up into the CE
definition (may be in future CEs):
 New robustness conditions needed (instead of consecutive independence/robustness tests
which give only one data point on the RoC for precision/ recall)?
 Matching scenario used (only a→a’ or also a→a’…z’).
 Localisation accuracy? (1s – can this be relaxed in some scenarios?)
 Scalability & complexity of querying large datasets – specific CE needed?
2.3 Output documents related to MPEG-7 Visual
No.
10345
10346
10347
Title
15938-3 Visual
Description of Core Experiments in Video Signature Description
development
23000-3 Photo Player Application Format
Request for ISO/IEC 23000-3/Amd.2
Text of ISO/IEC 23000-3/PDAM2 Conformance Testing for
Photo Player MAF
TBP Available
N
09/02/06
N
N
09/02/06
09/02/06
3 23002 MPEG-C Video Technologies
3.1 23001-4 and 23002-4 Reconfigurable Video Coding (RVC)
3.1.1
General status of work
The two parts related to RVC (ISO/IEC 23001-4 Codec Configuration Representation in MPEG-B
and ISO/IEC FCD 23002-4 Video Tool Library in MPEG-C) were progressed into FDIS status, and
most of the work during the week was dedicated to this.
• In ISO/IEC FDIS 23001-4 Codec Configuration Representation (N10349), mainly the parts
describing languages and their usage (FNL, BSDL) have been enhanced.
• In ISO/IEC FDIS 23002-4 Video Tool Library (N10351), lots of detailed changes were
made, in particular more precise description of FUs, aligning text with software etc. was
provided.
103
No fundamental technical issues have been changed relative to the “Study of FPDAM” texts.
However, a serious check of the current software status has indicated that MPEG-2 main profile is
not fully supported yet. AVC baseline profile is supported, but currently the parser is not fully
automatically generated from BSDL (instead, it is a hand-coded functional unit). There is however
no doubt that fully automatic generation of the parser is possible in principle, and this
implementation will not affect the description in the FDIS text.
Whereas the text of the 23002-4 standard is relevant to describe the current content of the video tool
library, it does not give a normative reference to FU behaviour. To make the specification
“complete” with this regard, amendment 1 (software and conformance related to the suite of tools in
the current standard text) was issued. As decided by the October 2008 meeting, the normative
reference of input/output behaviour of FUs will be described by the input/output of the related
software module(s). It is clear, however, that different implementations could be made in principle
with identical I/O behaviour; the software modules as such are therefore not normative.
It was further discussed and clarified that RVC is still in "phase 1", which is re-implementation of
existing decoder conformance points
• MPEG-4 SP and MPEG-4 AVC CBP currently
• MPEG-2 MP, MPEG-4 ASP, AVC HP and SVC to follow soon in Amd.2.
"Phase 2" could open up more options that were previously discussed in the RVC context, in
particular
• Possible simplification of standards development by adding new FUs. Whereas this
approach sounds attractive, it appears clear that a real simplification (by adding / exchanging
single FU modules) would only be given for cases where a preceding standard is fully
implemented in RVC schema – it needs to be clarified whether RVC-based encoders are
necessary in this context as well.
• downloadable (on the fly) decoder solutions
More of this will be discussed and provided in a future update of the vision document, for which not
sufficient time was available during the Lausanne meeting.
3.2 Assignment of editors
Documents
FDIS of 23001-4 (MPEG-B CCR)
FDIS of 23002-4 (MPEG-C VTL)
Workplan
Conformance & RSM WD
Extension to VTL
RVC Vision
RVC Core Experiments
Editors
Gwo Giun
Hwa Seon
Marco
Gwo Giun, Christophe
Mikael
Euee
Ihab
3.3 Allocation of input contributions
 MPEG-B FDIS Preparation
Doc.
Category
Title
No.
Authors
MPEG-B
m16076
Video
Hyungyu Kim, Sinwook Lee, Hwa
Seon Shin, Sowon Kim, Minsoo
Park, Euee S. Jang
Comments on ISO/IEC 23001-4
FCD 2
104
Recommendation
m16145

Matthieu Wipliez
Proposed changes for RVC-CAL
Mickael Raulet
annex A of ISO-IEC 23001-4
Jean-François Nezan
All
General/All
 MPEG-C FDIS preparation
Doc.
Category
Title
Authors
No.
MPEG-C
An MPEG Fixed Point IDCT
Ihab Amer, Marco Mattavelli
Video
Module for the RVC VTL
m16164
 Fixed point 8x8 IDCT cal module is proposed with an
Recommendation
enhanced throughput capability
 [Recommendation] To put into RSM
MPEG-C
Editor's Input on Study Text of
Yi-Shin Tung, Hwa Seon Shin
ISO/IEC FCD 23002-4
m16030 Video
Recommendation

Hwa Seon Shin, Sowon Kim,
MPEG-C
Revised FU Network and
m16031
Minsoo Park, Byeongho Choi,
Video
Tokens for MPEG-4 SP
Chungku Yie, Euee S. Jang
Recommendation
 Discussed during the FDIS preparation.
 RVC Vision
Doc.
Category
No.
MPEG-B
m16077
Video
Title
Authors
Update proposal on the Vision of
RVC
Hyungyu Kim, Sinwook Lee, Euee
S. Jang
 2nd Phase Work
m16033
MPEG-C
Video
Summary &
Recommendation
m16136
MPEG-C
Video
Summary &
Recommendation
m16137
MPEG-C
Video
Summary &
Recommendation
m16138
MPEG-C
Video
Summary &
Recommendation
Functional unit of AVC
Gwo Giun Lee, Jia-wei Liang,
deblocking filter with
He-Yuan Lin, Ming-Jiun Wang
MBAFF
 Deblocking filter FU for AVC is proposed.
 Accepted the FU in VTL extension and RSM.
An AVC Entropy Coding
Hussein Aman-Allah, Ihab Amer,
Module for the MPEG RVC
Marco Mattavelli
VTL
 Entropy coding FU for encoding tools is proposed.
 To start a CE 3 on developing encoding tools.
An AVC Motion estimation
Ehab Asaad Hanna, Ihab Amer,
Module for the MPEG RVC
Marco Mattavelli
VTL
 ME FU for encoding tools is proposed.
 To start a CE 3 on developing encoding tools.
An AVC Intra Prediction
Karim Maarouf, Ihab Amer,
Module for the MPEG RVC
Marco Mattavelli
VTL
 Intra prediction FU for encoding tools is proposed.
 To start a CE 3 on developing encoding tools.
105
Other Documents reviewed:
m15947
Ad Hoc Group on Reconfigurable Video Coding
m15781
m15782
Summary of Voting on ISO/IEC FCD 23001-4
Summary of Voting on ISO/IEC FCD 23002-4
Euee S. Jang, Marco Mattavelli,
Kazuo Sugimoto
SC 29 Secretariat
SC 29 Secretariat
Output Documents:
No.
10348
10349
10350
10351
10352
10353
10354
10355
10356
Title
23001-4 Codec Configuration Representation
Disposition of Comments on ISO/IEC FCD 23001-4
Text of ISO/IEC FDIS 23001-4 Codec Configuration
Representation
23002-4 Video Tool Library
Disposition of Comments on ISO/IEC FCD 23002-4
Text of ISO/IEC FDIS 23002-4 Video Tool Library
Request for ISO/IEC 23002-4/Amd.1
Text of ISO/IEC 23002-4/PDAM1 Video Tool Library
Conformance and Reference Software
WD 4 of ISO/IEC 23002-4/Amd.2 (Tools for MPEG-2 MP,
MPEG-4 ASP, AVC HP and SVC)
RVC Work Plan and FU Development Status
Description of Core Experiments in RVC
TBP Available
N
N
09/02/06
09/03/31
N
N
N
N
09/02/06
09/03/31
09/02/06
09/02/25
N
09/02/13
N
N
09/02/06
09/02/06
4 Explorations – 3D Video
The goal of 3D video, as a first step towards a broader range of free-viewpoint (FTV) applications,
is to generate interpolated views from available videos of multiview camera configurations. The
target application is mostly seen for upcoming generations of (auto-) stereoscopic displays, for
which only a low number (1) of video sequences will be transmitted, but rendering of additional
views will be enabled by associated depth information. The main achievements in this activity have
been review of technical developments from the exploration experiments, further clarification about
the vision, applications and requirements, and planning of next steps.
The vision is definition of a new 3D Video (3DV) format that goes beyond the capabilities of
existing standards to enable both advanced stereoscopic display processing and improved support
for auto-stereoscopic N-view displays, while enabling interoperable 3D services. This is illustrated
in the following Figure. It is assumed that only limited camera inputs and constrained rate
transmission would be available according to a distribution environment. The 3DV data format aims
to be capable of rendering a large number of output views for auto-stereoscopic N-view displays
and support advanced stereoscopic processing.
106
Stereoscopic displays
• Variable stereo baseline
• Adjust depth perception
Left
Right
Limited
Camera
Inputs
Data
Format
Data
Format
Constrained Rate
(based on distribution)
Auto-stereoscopic
N-view displays
• Wide viewing angle
• Large number of
output views
Compared to the existing coding formats, the 3DV format would have several advantages in terms
of bit rate and 3D rendering capabilities, which is illustrated in the Figure below:

2D+Depth, as specified by MPEG-C Part 3, supports the inclusion of depth for generation of
an increased number of views. While it has the advantage of being backward compatible
with legacy devices and is agnostic of coding formats, it is only capable of rendering a
limited depth range since it does not directly handle occlusions. The 3DV format expects to
enhance the 3D rendering capabilities beyond this format.

Multiview Video Coding (MVC) supports the direct coding of multiple views and exploits
inter-camera redundancy to reduce the bit rate. Although MVC is more efficient than
simulcast, the rate of MVC encoded video is proportional to the number of views. The 3DV
format is expected to significantly reduce the bit rate needed to generate the required views
at the receiver.
Bit Rate
Simulcast
3DV should be compatible with:
• existing standards
• mono and stereo devices
• existing or planned infrastructure
MVC
3DV
2D+Depth
2D
3D Rendering Capability
To make a 3D video system operational, it is necessary to provide view synthesis (interpolation)
with sufficient quality. To achieve this, and produce anchor encodings for depth maps and video
data based on currently available compression technology (AVC / MVC / MPEG-C part 3) is the
main objective of current exploration experiments. Judging of results was performed by experts
107
viewing using stereoscopic and autostereoscopic displays. The main conclusions drawn are the
following:
– Good progress has been made in view synthesis, appropriate methods for hole filling and depthdiscontinuity boundary processing have been implemented.
– Depth estimation was also improved (e.g. by implementing temporal consistency and spatial
smoothing), and now gives better results for sequences that were of unacceptable quality before
(e.g. Champagne Tower); however, apparently the generated depth maps are still considerably
wrong, in particular in cases of small structures, larger depth ranges and more complicated
occlusion areas (e.g. the Newspaper and Leaving Laptop sequences).
– It was discussed that the unavailability of reliable depth maps currently is the main factor that
delays the development. To make progress, it was decided to consider using hand-tuned or
semi-automatically generated depth maps, in particular for the more challenging sequences. A
Call for depth maps and supplementary information (e.g. segmentation masks) was therefore
issued (N10359).
Exploration Experiments in 3D Video Coding are continued as follows (N10360):
• EE1: Improvement of depth estimation
– Including one semi-automatic method
• EE2: Improvement of view interpolation
– Including combination of "best known" methods
• EE3: Investigation of alternative method for representation: Layered depth video (LDV)
Usage of hand-tuned depth maps appears also realistic for application scenarios of 3D Video, in
particular could such data be provided during the production phase (not for real-time applications).
Once the data are available and proven to provide sufficient quality in combination with the
available view synthesis, the next step towards a CfP will be made by defining rate points and
coding conditions for the anchors. The previous EE4, which was related to first findings about
coding of video with associated depth maps, was judged to have been not suitable for useful
conclusions, because the currently available quality of depth maps could have significant effects on
the necessary data rate, and the quality seemed to be more affected by the defects in the original
depth maps themselves than by deviations introduced by coding.
Documents reviewed in AHG (see AHG report)
m16021
m16026
Depth Map Compression for View Synthesis in FTV
Results of 3DV/FTV Exploration Experiments, described in w10173,
for Alt Moabit sequence.
m16027
m16034
Analysis of sub-pixel precision in Depth Estimation Reference
Software and View Synthesis Reference Software
Application of Middle Level Hypothesis algorithm for improvement of
depth maps produced by Depth Estimation Reference Software.
Basic LDV view-synthesis/renderer SW : LDVS
m16040
LDV Virtual View Rendering Software
m16041
Temporal Improvement Method in View Synthesis
m16042
3DV EE3 Report on Champagne_tower Sequences
m16043
3DV EE4 Report on Dog Sequences
m16046
3DV EE3 results on Dog sequence
m16028
108
Gangyi Jiang
Olgierd Stankiewicz
Krzysztof Wegner
Krzysztof Klimaszewski
Krzysztof Wegner
Olgierd Stankiewicz
Olgierd Stankiewicz
Krzysztof Wegner
Fons Bruls
Lincoln Lobo
Yin Zhao
Lu Yu
Yin Zhao
Deliang Fu
Lu Yu
Fons Bruls
Lincoln Lobo
Yin Zhao
Deliang Fu
Lu Yu
Lianhuan Xiong
Yin Zhao
Deliang Fu
Lu Yu
Yin Zhao
Lu Yu
Carmen CHENG
Yan HUO
m16047
3DV EE4 results on Dog sequence
m16048
Depth Estimation Improvement for Depth Discontinuity Areas and
Temporal Consistency Preserving
m16049
3DV/FTV EE3/EE4 Results on Alt Moabit sequence
m16050
Depth Map Coding Quality Analysis for View Synthesis
m16053
3DTV Exploration Experiments on Pantomime sequence
m16059
Results of Exploration Experiments in 3D Video for Lovebird2
m16060
EE1: Depth Estimation Results on 'Pantomime? Sequence
m16061
EE2: View Synthesis Results on 'Pantomime? Sequence
m16062
EE4: Coding Results on 'Pantomime? Sequence
m16063
Experimental Results on Improved Temporal Consistency
Enhancement
m16064
Implementation of Boundary Noise Removal for View Synthesis
m16066
Report of 3DV/FTV Exploration E xperiments with Champagne
Tower
m16067
3DV/FTV EE results of Depth Estimantion and View Synthesis on
"lovebird1" sequence
m16068
3DV/FTV EE4 result of Coding Experiment on "Dog" sequence
m16070
The consideration of the imrpoved depth estimation algorithm
m16087
3DV/FTV EE3 : LeavingLaptop and Lovebird1
m16088
3DV/FTV EE4 : Dog sequence
m16090
View Synthesis Algorithm in View Synthesis Reference Software 2.0
(VSRS2.0)
m16091
View Synthesis Method without Blending
m16092
Depth Estimation Reference Software (DERS) with Image
Segmentation and Block Matching
m16094
Results of 3D Video Coding Experiments EE1 and EE2 for Dog Data
Set
3DV/FTV EE Report on Doorflower sequence
m16101
109
Yu LIU
Carmen CHENG
Yan HUO
Yu LIU
Hui Yuan
Yilin Chang
Haitao Yang
Xiaoxian Liu
Sixin Lin
Lianhuan Xiong
Xiaoxian Liu
Yingying Guo
Haitao Yang
Junyan Huo
Yilin Chang
Sixin Lin
Lianhuan Xiong
Siping Tao
Ying Chen
Miska M. Hannuksela
Houqiang Li
Ivana Radulovic
Per Fröjdh
Sehoon Yea
Zafer Arican
Anthony Vetro
Cheon Lee
Yo-Sung Ho
Cheon Lee
Yo-Sung Ho
Cheon Lee
Yo-Sung Ho
Sang-Beom Lee
Cheon Lee
Yo-Sung Ho
Cheon Lee
Yo-Sung Ho
Takanori Senoh
Kenji Yamamoto
Ryutaro Oi
Tomoyuki Mishina
Makoto Okui
Gun Bang
Gi Mun Um
Namho Hur
Jinwoong Kim
Gun Bang
Gwang sin Cho
Namho Hur
Jinwoong Kim
Donggyu Sim
Gun Bang
Jaeho Lee
Namho Hur
Jinwoong Kim
Patrick Lopez
Dong Tian
Patrick Lopez
Dong Tian
Masayuki Tanimoto
Toshiaki Fujii
Kazuyoshi Suzuki
Masayuki Tanimoto
Toshiaki Fujii
Kazuyoshi Suzuki
Masayuki Tanimoto
Toshiaki Fujii
Kazuyoshi Suzuki
Mejdi Trimeche
Miska M Hannuksela
Shinya Shimizu
m16129
Results of Exploration Experiments in 3D Video Coding for Dog Data
Set
m16135
Philips 3DV EE2,EE4 results
m16139
Philips 3DV EE1,2,3,4 results
m16175
3DV EE1 & EE2 on Leaving_Laptop and Improvements in ViSBD 2.1
m16189
3DV EE1 and EE2 Results on Newspaper Sequence
m16190
3DV EE4 Results on Pantomime Sequence
Hideaki Kimata
Y. Wang
K. Müller
P. Merkle
A. Smolic
Fons Bruls
Lincoln Lobo
Fons Bruls
Lincoln Lobo
Dong Tian
Po-Lin Lai
Patrick Lopez
Jaewon Sung
Yong-Joon Jeon
Byeong-Moon Jeon
Jaewon Sung
Yong-Joon Jeon
Byeong-Moon Jeon
Documents reviewed in Video
m16065
m16093
m16130
Additional Test Sequence for 3D Video
Propose test sequence „study“ instead of „newspaper“ (similar scene, but
without large depth contrast at the newspaper).
No synthesis results provided, but proponents say that results are better.
Question to be discussed: Is it appropriate to replace „difficult“ sequences?
3DV must work for any. Thank for the contribution.
Data Format for FTV
"FDU" (FTV data unit) consists of depth data and synthesis error for
sythesized views relative to the center view.
This is rather an alternative representation method than a data format.
It is claimed that the residual for the views in between center and left/right
view is correlated with the residual of the left/right views, and therefore can be
beneficial for synthesis. Could also be beneficial for coding when a wider
range of views is required. All this is still to be proven. No action to be taken
at this point.
Considerations about the vision on 3D Video
Enable 3D video on 3D displays such that observer gets a good depth
impression. Both for stereoscopic and multi-view (autostereoscopic) displays.
Cheon Lee
Jae-Il Jung
Yun-Suk Kang
Yo-Sung Ho
Masayuki Tanimoto
Toshiaki Fujii
Kazuyoshi Suzuki
Aljoscha Smolic
Karsten Mueller,
Peter Kauff, Thomas
Wiegand
Compatibility with mono and stereo (i.e. that the uncompressed format
includes a mono or stereo view) as „may“
Complete 3DV solutions should be compared and be judged by final outcome:
How good does it look on stereo and N-view displays?
Same input should be used for all proposals.
m16165
Reference solution should be developed by MPEG
On addressing market 3D developments, Stereo & MPEG 3DV activity.
Lack of one clear recognized 3D standard – this could cause confusion in the
market. Applications could e.g. include autostereoscopic, variable-stereo
Fons Bruls
Lincoln Lobo
Wiebe de Haan
Typical requirements of view synthesis range is up to 4 baseline distances,
where "1 BD" would be the configuration of "best parallax distance" on a
stereo display. Currently, the experiments use up to 3BD.
Stereo baseline adjustment (for current stereo displays) could be a near-term
issue, before the market introduction of autostereoscopic displays.
Question is raised on whether the encoded format of 3DV should be display
unaware and backward compatible. This would be a big advantage compared
to current stereo, where everything (starting from camera settings) is often
fully tuned to one target display type.
Output documents:
No.
Title
TBP Available
110
10357 Vision on 3D Video Coding
10358 Applications and Requirements of 3D Video Coding
10359 Call for 3D Test Material: Depth Maps & Supplementary
Information
10360 Description of Exploration Experiments in 3D Video Coding
Y
N
Y
09/02/06
09/02/06
09/02/06
N
09/02/06
5 Explorations – High-Performance Video Coding
Following the workshop in Busan, the main focus of the HVC activity currently is to prepare a more
solid assessment of the availability of improved-compression technology, that would in particular
fulfil the needs of high-resolution, high-quality video applications. In Busan, a Call for test
materials had been issued, as it was recognized that the currently available materials would not
fulfil this purpose, in particular for HD and Ultra HD scenarios. The following responses were
received (see more detailed assessment in the AHG report M15950):
m16018
Response to call for test materials for
HVC study
m16035
Response to Call for Test Materials for
High-Performance Video Coding
Standards Development
Samsung response to Call for Test
Materials for MPEG HVC
standardization
m16052
m16212
M16219
BBC 1080p50 test materials for HVC
study
Status of potential test materials for
HVC with 4K or higher resolutions
Shun-ichi Sekiguchi
Yoshihisa Yamada
Yoshiaki Kato
Kohtaro Asai
Tokumichi Murakami
TK Tan
Yoshinori Suzuki
Woo-Jin Han
JeongHoon Park
IlKoo Kim
Tammy Lee
Thomas Davies
Kohtaro Asai, Ryuta Suzuki, Shun-ichi Sekiguchi
After extensive viewing sessions, a set of 4 sequences for the HD/UHD range of sizes (including
newly proposed and 2 from the available SVT set), and another 4 sequences for the WVGA range
(as expected to become important in mobile applications) were selected for the upcoming Call for
Evidence.
However, it was assessed that expecting responses to the Call by April would result in a very short
timeline and probably a lower number of responses. More specifically, after distributing and
converting the new test sequences, AVC anchor encoding that is necessary to define the rate points
would hardly be available before end of April. It was therefore decided to issue again a Draft Call
(N10363), substantially refined as compared to the initial version of the previous (Busan) meeting.
A final Call for Evidence is planned to be issued in April, with responses expected for July. Once
evidence would be available, a Call for proposals could be issued immediately, with possible
responses for January 2010. It must also be observed that certainly still better test material would be
desirable for a CfP and development of an HVC standard. Therefore, an updated Call for Test
materials was also issued (N10362). Further, the expert viewing to be performed in the context of
the CfE is challenging in terms of necessary testing equipment (in particular for UHD resolutions),
and must be carefully further explored and prepared.
Materials to be provided within the responses to the Call for Evidence will be decoded results,
bitstreams and a binary decoder. It will not be necessary to submit for all classes (ranging from
WQVGA up to UHD), but a submission must be complete for each class that is chosen.
111
The following input contributions, reporting about methods for improved compression, were
reviewed:
m16019
On coding efficiency with extended block size for UHDTV
Setting only IPPP: Usage of extended macroblock size 32x32: 7.8% BR
reduction for 1 4Kx2K, approx.4.5% for 1080p, 7.9% for
720p.Performance improvement mainlyby reduction of MV bits.
No experiments on B pictures yet.
m16069
Fast Decoder Side Motion Vector Derivation with Candidate Scaling for
Improving AVC Compression Performance
Additional MB types where the decoder can derive MVs. Encoder
decides which mode is used. In contrast to previous contribution from
Archamps, reduction of complexity is main target. Use left and top-right
neighbors as candidates. Options with or without sub-pel refinement. In
case of multi-hypothesis prediction, scaling of candidates is used.
Results are 9%/8% for HD, and 5.8%/5% for subpel/no subpel
refinement, respectively. When modified rounding (not standard AVC) is
used, gain is lowered. Results only for IPPP, but contributors expect gain
for B pictures as well.
Preliminary response for Draft Call for Evidence on High Performance
Video Coding
Extended macroblock size to 32x32 with same sub-partitions as in AVC.
IPPP coding structure. Use 8Kx4K sequences, but only cropped area of
size 1920x1080 out of these. Average around 15% over 6 sequences,
each with „best“and „worst“cropped area.
Second Order Prediction of Video Coding
Inter prediction followed by intra prediction (i.e. intra prediction of
residuals) as additional mode. To generate the residual at the boundary,
the MV of the current block is used (i.e. not the actual prediction signal of
the neighbored block). Bitrate reduction with CABAC around 4.5% similar
for IPPP, IBBP and IBbBP. Reduces to around 3% with WP on. CAVLC
slightly higher gains. Encoder complexity increase less than 10% on
average, decoder less than 4% average (considering runtime). Selection
of mode varies, but could be up to 40% for some cases.
Motion Vector Coding with Optimal Predictor
Extension of motion vector competition. Decoder could find the best MV
predictor among candidates. If certain conditions are violated, the
encoder signals that the median or another candidate shall be used.
Candidate is determined by template matching. Gain up to 6.8% BR
reduction for 720p, 3.6% for CIF. Interpretation: Number of skip modes is
increased. No investigation about error propagation of the scheme yet.
Temporal neighbor used as in MVC.
m16082
m16109
m16209
Shun-ichi Sekiguchi
Shuichi Yamagishi
Yoshihisa Yamada
Yoshiaki Kato
Kohtaro Asai
Tokumichi Murakami
Steffen Kamp
Mathias Wien
Tomonobu Yoshino
Sei Naito
Shigeyuki Sakazawa
Shangwen Li
Lu Yu
Lianhuan Xiong
Jungyoup Yang
Kwanghyun Won
Byeungwoo Jeon
Su Nyeon Kim
The following input documents were discussed jointly with the Requirements subgroup, and were
taken as an initial point for further updates of the vision, applications and requirements document
(N10361):
m16207
m16224
Requirements for high-performance video standards
Mobile devices should be capable to decode content from video databases
and home servers, which are going to 720p or 1080p. Main target of the
contribution is to include tradeoff between power efficiency / complexity
such as "half complexity as AVC gives 25% coding efficiency improvement,
while 2-3x complexity as AVC gives 50% coding efficiency". Could be
structured in different profiles. Also request shorter timeline for the lowcomplexity case.
Conclusion: Include relationship between complexity and compression
efficiency in A&R doc. Consider also relationship with delay.
Proposal on Focus for MPEG HVC standard development
Requirements can differ between different applications, e.g. mobile and
professional. "Most practical way to start with high-efficiency solution and
strip off tools to arrive at lower complexity". Complexity can hardly be
quantified. Propose onion-shaped profile structure.
Output documents:
No.
Title
Kemal Ugur
Justin Ridge
Ken McCann
Woo-Jin Han
Jason Suh
TBP Available
112
10361
10362
10363
Vision and Requirements for High-Performance Video Coding
(HVC)
Call for Test Materials for High-Performance Video Coding
Standards Development
Draft Call for Evidence on High-Performance Video Coding
113
Y
09/02/06
Y
09/02/06
N
09/02/06
Annex H – JVT report
Source: Jens Ohm and Gary Sullivan, Chairs
114
Annex I – Audio report
Source: Schuyler Quackenbush, Chair
Audio Subgroup Report for the 87th MPEG Meeting
Source: Schuyler Quackenbush, Chair, Audio Subgroup
1
2
Opening of the meeting ......................................................................................................... 117
Administrative matters .......................................................................................................... 117
2.1 Communications from the Chair
117
2.2 Approval of agenda and allocation of contributions 117
2.3 Creation of Task Groups
117
2.4 Approval of previous meeting report 117
2.5 Review of AHG reports
117
2.6 Joint meetings 117
2.7 Received National Body Comments and Liaison matters 117
2.8 Plenary Discussion 118
3 Record of AhG meetings ....................................................................................................... 118
3.1 AhG Meeting on USAC Sunday 1000-1700 118
4 Task group activities ............................................................................................................. 123
4.1 Joint meetings 123
4.1.1 MPEG Surround Signalling and MP4 FF issues (with Systems) ................................ 123
4.1.2 High-Performance Video Coding (HVC) and Audio (with Requirements) ................ 124
4.2 Task Group discussions
124
4.2.1 MPEG-2, MPEG-4, MPEG-7 Audio and MPEG Surround, conformance, reference
software ................................................................................................................................ 124
4.2.2 MPEG-D Spatial Audio Object Coding ...................................................................... 125
4.2.3 MPEG-D Unified Speech and Audio .......................................................................... 130
4.2.4 Exploration: Meta-Data ............................................................................................... 135
5 Audio closing plenary discussions ........................................................................................ 136
6 Meeting deliverables ............................................................................................................. 136
6.1 Responses to Liaison and NB comments
136
6.2 Recommendations for final plenary 136
6.3 Establishment of Ad-hoc Groups
136
6.4 Approval of output documents
137
6.5 Press statement
137
7 Future activities ..................................................................................................................... 137
7.1 Schedule of future meetings 137
7.2 Agenda for next meeting
137
7.3 All other business
137
7.4 Closing of the meeting
137
Annex A Participants ............................................................................................................... 138
Annex B Audio Contributions and Schedule .......................................................................... 139
115
Annex C
Annex D
Annex E
Task Groups ............................................................................................................. 144
Output Documents ................................................................................................... 145
Agenda for the 88th MPEG Audio Meeting ............................................................. 147
116
1
Opening of the meeting
The MPEG Audio Subgroup meeting was held during the 87th
meeting of WG11, February
2-6, 2009, Lausanne, CH. The list of participants is given in Annex A.
2
2.1
Administrative matters
Communications from the Chair
The Chair summarised the issues raised at the Sunday evening Chair’s meeting, proposed task groups for the week, and
proposed agenda items for discussion in Audio plenary.
2.2
Approval of agenda and allocation of contributions
The agenda and schedule for the meeting was discussed, edited and approved. It shows the
documents contributed to this meeting and presented to the Audio Subgroup, either in the task
groups or in Audio plenary. The Chair brought relevant documents from Requirements, Systems to
the attention of the group. It was revised in the course of the week to reflect the progress of the
meeting, and the final version is shown in Annex B.
2.3
Creation of Task Groups
Task groups were convened for the duration of the MPEG meeting, as shown in Annex C. Results
of task group activities are reported below.
2.4
Approval of previous meeting report
The 86th
2.5
Audio Subgroup meeting report was registered as a contribution, and was approved.
Review of AHG reports
There were no requests to review any of the AHG reports.
2.6
Joint meetings
The joint meetings with Audio for the week are shown below:
Groups
Audio, Systems
Audio, Req
2.7
What
File Format
Audio for HVC
Where
Audio
Audio
Day
Wed
Thu
Time
1400-1500
1500-1530
Received National Body Comments and Liaison matters
The NB Comments and Liaison documents for the meeting that require a response are as shown
below.
No.
m16044
From
AU NB
m16045
AU NB
m15911
m15879
m15914
M15908
CN NB
FI NB
FR NB
Liaison Statement from DRM
m15916
Liaison Statement from
"ETSI/EBU/CENELEC JTC on Broadcast"
Audio Liaison Statement from WorldDMB
Forum via SC 29 Secretariat
m15919
m15978
Liaison Statement from ITU-R SG 6 to SC
29/WG 11
Title
Comment on the unified speech and
audio coding activity
Comment on the exploration on
metadata driven post-processing of
audio
Comment on USAC
Comment on USAC
Comment on USAC
on 960 frame length in the MPEG-4
AAC family of profiles
on 960 frame length in the MPEG-4
AAC family of profiles
on Proposal to remove 960 transform
from the AAC, HE AAC and HE AAC
v2 profiles
on Extension of ITU-R BS.1387 and
Call for Proposal
117
Comment
(from 86th mtg)
(from 86th mtg)
(from 86th mtg)
(from 86th mtg)
(from 86th mtg)
(from 86th mtg)
Respond at next
meeting
m16004
IEC TC100/TA4
m16218
IEC TC100/TA4
2.8
IEC CDV 61937-11: Digital audio -Interface for non-linear PCM encoded
audio bitstreams applying IEC 60958 - Part 11: MPEG-4 AAC and its
extensions in LATM/LOAS
IEC CDV 60958-3/Amd.1
Plenary Discussion
It was communicated to the Chair via informal channels that the use of the complexity metrics PCU
and MCU in MPEG Audio specifications is not clear to technical experts from outside MPEG. The
Chair attempted to clarify this via informal channels, but noted that Audio experts might revise
documents on complexity to address this lack of clarity.
3
3.1
Record of AhG meetings
AhG Meeting on USAC Sunday 1000-1700
USAC Core Experiments
Taejin Lee, ETRI, presented
m16156
Progress of Technology Merge Between System 2 and
USAC RM
Taejin Lee
Max Neuendorf
Jeremie Lecomte
Kyeongok Kang
Bernhard Grill
This contribution discusses the merge of Sys2 SBR technology into RM0. Listening tests show
promise, but did not demonstrate improvement at the 95% level of significance. The SBR
technology merge will continue to be explored. In addition, aspects of TCX will be explored to see
if a technology merge can result in a gain in performance. ETRI anticipates reporting additional
information at the next meeting.
Several experts noted that the rules for the “technology merge” were not clear, nor were the specific
tools under consideration clear. It was the consensus of the AhG that the “technology merge” obey
the elements of the Core Experiment process.
Eunmi Oh, Samsung, presented
m16177
Progress report on unvoiced speech coding
Hosang Sung
Eunmi Oh
Eunmi Oh noted that the Sys4/RM0 “technology merge” work started in October, after the RM0
source code was available to MPEG. She also noted that in exploring the possibilities for
technology merge, the following were unexpectedly identified as candidates for the technology
merge:
 coding of phase in MPEG Surround
 coding of unvoiced speech segments along with variable bit rate encoding.
The contribution presented details on a technology merge/CE on Voiced/Unvoiced/Silence plus
Variable Bit Rate. It presents theoretical bitrate savings for the Voiced/Unvoiced/Silence detection
and coding mode switching. Samsung anticipates a workplan at this meeting and to report
additional information at the next meeting.
JungHoe Kim, Samsung, presented
m16173
Progress report on phase experiment for USAC
JungHoe Kim
Julien Robilliard
Eunmi Oh
Bernhard Grill
This contribution presented results for a technology merge/CE on enhancing the phase encoding in
the MPEG Surround tool for the purpose of coding stereo signals. Included are all elements required
for a CE proposal, including an overview of the technology and listening test showing performance.
Kristofer Kjörling, Dolby, noted that there are test items for which the MPEG-4 PS tool was not
able to sufficiently accurately estimating phase differences such that the proposed technology would
118
work. Both Dolby and Philips experts offered to do cross-check using additional test material. This
will be captured in a workplan and additional information is anticipated at the next meeting.
Max Neuendorf, FhG, presented
m16153
Proposed Corrections to WD and Reference Software on
Unified Speech and Audio Coding
Max Neuendorf
Philippe Gournay
Jérémie Lecomte
Markus Multrus
Stefan Bayer
Guillaume Fuchs
Ralf Geiger
Frederik Nagel
This contribution presented several categories of proposed fixed or changes to the WD text and
Reference Software. The categories are
 Editorial changes, which include:
o Incorrect fonts, formulas and references.
o Clarifications to ambiguous text.
o Obvious bugfixes in the Reference Software
o Changes to the text so that it is aligned to the Reference Software
 Bugfixes in software
o Incorrect handling of window transitions in decoder software and clarification of
associated text
o Incorrect time alignment of some parameters when core coder mode is switched
o Incorrect Harmonic SBR transitions
o Incorrect Harmonic SBR Stretching factor and clarification of associated text
o Incorrect Time-Warped SBR resampling buffer length. The presenter estimated that
this bug did not impact the CfP waveforms when they are quantized to 16-bit word
lengths.
o Extend MPEG Surround to lower sampling rates. This does not affect the CfP
waveforms as MPEG Surround was never active at the lower sampling rates.
o Other miscellaneous software bug fixes
 Proposal presented as Open Issues
o Encoder and Decoder block diagram
o Decoding of innovation sequence
The proposals were discussed as they were presented. The presenter noted that all software changes
except for those listed as “Open Issues” have already been incorporated into a revised version of the
software which can be found in the zip archive of the contribution.
Eunmi Oh, Samsung, requested a “sanity-check” listening test to verify that none of the reference
software changes have an impact on audio quality. Max Neuendorf will get back to the group
during the week with a proposal for such a listening test (i.e. at which operating modes will it be
evaluated).
It was the consensus of the AhG to adopt the Editorial and Bugfix changes (with the exception of
the window sequence signalling, which might conflict with another proposal in a contribution still
to be presented and in anticipation of a positive outcome of the “sanity-check” listening test).
The Open Issue items will be discussed during the MPEG week.
The Chair noted that contributions not presented in Sunday’s AhG meeting will be presented in the
Audio Subgroup.
Reference Encoder Software
Mohamad Raad, RaadTech Consulting, presented
m16044
Comment on the unified speech and audio coding activity
Mohamad Raad
The presenter read the AU NB comment. The NB comment asks for a common encoder/decoder
pair for use in the CE process. The Chair stated that, in his view, the request for a single encoder
code base for use in CEs might represent a viewpoint on which there is lack of consensus in the
Audio Subgroup.
119
Herve Taddei, Huawei Technologies, presented
m16079
Discussion on the Unified Speech and Audio Coding Activity Herve Taddei
Minjie Xie
Qing Zhang
The contribution noted that
 A reference encoder of high quality that is fully in source code would be of great benefit to
MPEG.
 A workplan should be maintained to organize the work of creating such a reference encoder.
This reference encode could be used as
 CE proponent shows merit of tool using the MPEG Reference Encoder
 The cross-check shows the merit of the tool using the Reference Quality Encoder
A successful CE obligated the proponent to integrate a (perhaps sub-optimal) version of the tool
into the MPEG Reference Encoder, and to provide evidence (e.g. listening test) that the quality of
the MPEG Reference Encoder is improved when the tool is incorporated.
The contribution included a listening test result showing that Huawei was able to, internally,
increase the quality of the current MPEG Reference Encoder at all bitrates. If, in Huawei’s view,
the Audio Subgroup consensus position on this matter is in line with Huawei’s view, then Huawei
would be willing to contribute the source code on which their listening tests are based.
Open issues are
 How would the Audio Subgroup determine that the MPEG Reference Encoder is of quality
“good enough” for use in meaningful CE work.
Kristofer Kjörling, Dolby, noted that when the MPEG Reference Encoder quality is quite low and
has many missing modules, then showing an improvement in the MPEG Reference Encode might
be quite difficult. He noted that a CE proponent might additional be obligated to provide
rudimentary versions of the missing modules. Markus Multrus, FhG, noted that many encoder
modules were inherited from the MPEG-4 Audio Reference Software such that if modules were
missing in the MPEG-4 Audio Reference software, they would be missing in the USAC Reference
Software.
Chair proposed break-out group to categorize MPEG Reference Encoder module into “signal flow”
and “control” and to investigate and verify what modules are present in the MPEG Reference
Encoder.
Werner Oomen, Philips, presented
m16110
Comment on the unified speech and audio coding activity
Werner Oomen
This contribution presents the following definitions:
 Informative Encoder – the source code available from ISO
 Reference Quality Encoder – the best quality encoder(s) within MPEG member companies
 Reference Encoder – the source code base proposed to be the “open source” project. (Note
that there may be reasons for NOT making this part of the source code available from ISO,
e.g. copyright and patent issues.)
The contribution observes
 MPEG CE methodology has produced outstanding technology that has been widely adopted
by industry.
 The CE methodology had evolved through numerous revisions
 There are typically three categories of CE
o Overwhelming clear winner
o Demonstrated clear merit
o Has ambiguous demonstration of merit
 A competitive CE process if beneficial
120
Concerning the Reference Encoder, Philips very much endorses creation of a high-quality
Reference Encoder, and if this Reference Encoder is also the Informative Encoder, then this speeds
adoption of the standard. However, they feel quite strongly that the Reference Encoder should not
be mandated for use in the CE process. Rather, a CE proponent should be free to use any encoder in
their CE proposal. However, it is quite likely that those with a Reference Quality Encoder will try
the CE tool in their Reference Quality Encoder code base to cross-check whether the tool provides a
comparable improvement.
The contribution provides figures that show example CE listening test results might drive CE results.
Mohamad Raad, RaadTech Consulting, asked what would occur if a CE was based on a low-quality
Reference Encoder, was accepted by the group, but the holder(s) of the Reference Quality Encoder
did not wish to incorporate the tool. The Chair responded that, first, the refusal to adopt the tool by
the Reference Quality Encoder proponent would in no way preclude the adoption of the CE tool.
But, second, it is the obligation of the Reference Quality Encoder proponent to incorporate the tool
and run a cross-check at the request of the Audio Subgroup.
Hyun-Kook Lee, LG, presented
m16118
Considerations on the development of common USAC
reference encoder
Dong Soo Kim
Sungyong Yoon
Hyun-Kook Lee
Jaehyun Lim
The contribution makes the following points:
 That there should be a common encoder for use in the CE process. This could be the
Reference Quality Encoder or a Reference Encoder that achieves comparable quality.
However, they stress the importance of a common encoder for all CEs.
 In order to create a Reference Encoder that becomes a Reference Quality Encoder,
requirements for development and a method for performance assessment must be agreed to.
 The results from successful CE should be incorporated into the Reference Encode.
The Chair noted that the current CE methodology does not mandate one and only one encoder for
use in CEs. Further, the Chair noted that the Audio Subgroup must balance potentially competing
objectives:
 that there be broad participation in the CE process, for example by the development of an
“open-source” Reference Encoder, although it may be that it does not have quality equal to
that of the best-known Reference Quality Encoder.
 That the standard uses the best known Reference Quality Encode in the CE process to insure
that only the best technology is adopted into the specification.
Kristofer Kjörling, Dolby, presented
m16140
Core Experiment procedures and MPEG reference software Kristofer Kjörling
encoder
Heiko Purnhagen
This contribution makes the following points
 MPEG has a long history of creating successful and widely adopted standards using the
existing CE methodology
 MPEG specifications leave the encoder as informative so that
o encoding performance can increase after the specification has issued
o different encoder tradeoffs (e.g. performance/complexity) are possible
 The MPEG process must use the best encoder available in the CE process in order to assure
that the highest quality specification is produced.
 Since the encoder is informative, MPEG cannot mandate that a proponent submit encoder
source code that may reveal their proprietary know-how.
The Chair noted that there may be some discussion on what level of informative source code must
be submitted. For example, if the decoder has as block-switching filterbank, then the encoder
121
should also have such a filterbank. The Chair proposed that it may be a role of the Audio Subgroup
to render an opinion as to whether the submitted code is “complete.”
Pierrick Philippe, Orange Labs, presented
m16143
Proposed improvements for MPEG Audio Core Experiment
Methodology and Reference Software Development
Pierrick Philippe
The contribution makes the following points:
 A poor quality Informative Encoder is useless for the purposes of standardization.
 It has the danger of being used by outside parties as representative of MPEG Reference
Quality Encoder
 A good quality Reference Encoder permits interested parties to quickly do a “trial” CE in
their own lab.
 A high quality Reference Encoder permits easier participation in the MPEG process
 A good quality Reference Encoder clearly need not be computationally efficient.
This contribution also proposes a way to collaboratively build a Reference MPEG Encoder.
It gives several examples of how performance results can be interpreted in the CE process.
It also comments on the current CE methodology:
 That the CE process use the CMOS test (e.g. 7-level, “A better than B”, A much better than
B”, …), and that the comparison RQ vs. AE+CE be made.
 That the CE uses a set of items that represent the three sets of categories: speech, speech and
music, and music. However, additions to the common set of items can be entertained for
specific tools.
There was considerable discussion on whether any conclusion could be drawn from a comparison of RQ vs AE+CE and
whether it is “good engineering practice” to compare one system against another system plus modification, in that one
cannot, in general, conclude that an observed difference in quality is due to the new tool as opposed to a difference
between the two encoders RQ and AE.
Kristofer Kjörling, Dolby, and Juergen Herre, FhG, noted that an MPEG Reference Quality
Encoder that was once labelled as “good” or “high” quality might be a disservice to MPEG years
later when commercial encoders far surpass the quality of the MPEG Reference Quality Encoder.
Roch Lefebvre, VoiceAge, presented
m16146
Comments on Core Experiments methodology for MPEG
USAC standardisation
Roch Lefebvre
Philippe Gournay
Redwan Salami
This contribution makes the following points
 A CE methodology exists, and the MPEG CE process is open to all experts, which is not the
case in some other standardization bodies.
 The CE process must balance
o Protection of RM0 and CE proponent encoder know-how.
o Making the CE process accessible to all MPEG experts.
Markus Multrus, FhG, presented
m16160
Thoughts on Core-Experiment Methodology
Bernhard Grill
Jürgen Herre
Ralf Geiger
Max Neuendorf
Markus Multrus
This contribution raised the following points:
 A Reference Quality Encoder should be used in CEs
 It is an unproductive diversion of resources to create an “open-source” Reference Encoder
and to maintain the code and verify its quality.
 The goal of CEs is to achieve a strictly monotonic improvement of the performance of the
specification. This can be assured if the CEs only use a Reference Quality Encoder.
 There need not be one and only one Reference Quality Encoder.
 The current CE process has worked well, leading to several widely adopted specifications.
122

The current CE process is still in force.
With regard to the Encoder Software
 The Reference Quality Encoder is available to all interested parties via e.g. a customdesigned object-code interface to RM0.
 The goal is to create a high-quality specification via continuously improving the normative
technology.
Imre Varga, Siemens, noted that it should not be the role of MPEG to mandate how the resources of
member companies should be spent.
Kei Kikuiri, NTT DoCoMo, presented
m16202
Comments on USAC Standardization Activities
Kei Kikuiri
Nobuhiko Naka
Kousuke Tsujino
This contribution makes the following points
 Common encoder used by CE proponents
 That the common encoder be of good to high quality
 Results of CE process be incorporated in the common encoder.
The Chair noted that contributions not presented in Sunday’s AhG meeting will be presented in the
Audio Subgroup.
4
Task group activities
4.1
Joint meetings
4.1.1
MPEG Surround Signalling and MP4 FF issues (with Systems)
Heiko Purnhagen, Dolby, presented
m16117
Thoughts on MPEG Surround signaling
Frans de Bont
Stefan Döhla
Heiko Purnhagen
Alexander Gröschel
The contribution presents a method for backwards-compatible signalling of MPEG Surround. David
Singer, Apple, noted that this “single-stream” MPEG Surround solution would also be compatible
with 3GPP File Format and IETF RTP streaming solutions. The presenter confirmed that there is
software that generates the bitstream and decodes the bitstream to produce an MPEG Surround
output. Furthermore the bitstreams with MPEG Surround signalling was fed to numerous AAC or
HE-AAC decoders and not “crashed” on the bistream and most played the core codec signal.
It was the consensus of Audio Subgroup to have a “Thoughts on MPEG Surround Signaling” output
document that NBs should consider when balloting.
Stefan Doehla, FhG, presented
m16054
Scalable Audio and MP4
Stefan Doehla
The contribution notes that a single-stream solution for SLS is not feasible for exact reconstruction,
as the SLS side information would overflow the AAC core input buffer constraints, and hence, a
two-stream solution is necessary.
However, multiple occurrences of “mp4a” tracks cause problems for many existing
implementations, especially when such tracks are not alternate encodings rather where one depends
on the other (e.g. one is a “base” AAC stream and the other is an SLS enhancement stream). Some
implementations only parse the first audio track encountered and ignore any additional tracks while
others do nothing even though they could decode the base track.
The contribution proposes an extension to the MP4FF specification that would support the notion of
“base” and “dependent” streams in the context of audio.
 mp4a is audio base track
123

m4ae is audio enhancement track
MP4FF supports a track reference structure (dpng) that might be used to infer base and extension
streams. Not all implementations reference this information, but they could. New audio codecs
could explicitly indicate that this should be done, e.g. in an informative section.
Systems proposed to start an amendment to 14496-14 (MP4 FF) to implement the “mp4ae” box
type.
Discussion of carriage of Audio Profile indication
Dave Singer noted that 14496-14/Cor1 already supports carriage of an Audio Profile indication
within the ES_Descriptor.
4.1.2
High-Performance Video Coding (HVC) and Audio (with Requirements)
The MPEG Video Subgroup is considering video coding for devices whose capabilities are beyond
those of current HD. These might be Ultra-HD (UHD) devices, such as 4K x 2K displays. We note
that for such UHD displays, a much closer viewing distance is feasible and perhaps desirable. This
could be envisioned as a “personal UHD” experience in which there is both visual and audio
envelopment. This could have significant impact on audio presentation such that accurate sound
localization can occur. If there is truly only one viewer, it might be that aspects of the audio
presentation might be individualized in some meaningful way.
Audio experts can track documents in the “HVC” work item of the WG11 resolutions.
4.2
Task Group discussions
4.2.1
MPEG-2, MPEG-4, MPEG-7 Audio and MPEG Surround, conformance, reference
software
Tilman Liebchen, LG, presented
m16036
Proposed Text of ISO/IEC 144963:2005/Amd.2:2006/DCOR4
Noboru Harada
Tilman Liebchen
Takehiro Moriya
Yutaka Kamamoto
This contribution is candidate corrigendum text that:
 corrects syntax of pseudo-code
 clarify mathematical operations in pseudo-code such that they are never ambiguous
 clarify use of terms (i.e. layer vs. stage)
Kristofer Kjörling, Dolby, presented
m16056
Proposed Draft Corrigendum on AAC-ELD
Markus Schnell
Per Ekstrand
This contribution is candidate DCOR text that
 preserves text describing how an implementer can alter the complex-exponential phaseshifts in the Complex QMF low delay filter bank based on implementation goals
 provides PCU and RCU complexity figures
Pierrick Phillipe, Orange Labs, asked to clarify the definition of PCU and RCU and how they were
derived. The Chair noted that, by his understanding, 1 PCU is defined as 1 Million signal
processing operations per second (e.g. multiply-accumulate is one operation) and 1 RCU is defined
as 1 K-Word of storage, where a “Word” is the storage required for an arithmetic value.
Kristofer Kjörling
m16141
Proposal for splitting the current AAC family profiles into two Kristofer Kjörling
Andreas Schneider
This topic has been raised at previous MPEG meetings. This contribution reviews the timeline and
history of the issue and the status of support for 960 transform length with respect to reference
software and conformance. As of the date of this MPEG meeting, all conformance sequences for
960 block length are available and that the reference software (mp4mcDec) has been extended to
support 960 block length. It notes that most implementations that use the AAC or HE-AAC v2
124
profile use and support exclusively 1024 block length and only DAB+ uses HE-AAC V2 with 960
block length. DRM uses HE-AAC v2, but uses a ER Scalable AAC core.
The contribution proposes the following changes to the profiles:
 AAC profile, HE-AAC profile and HE-AAC v2 profile be restricted to 1024 block length.
 HE-AAC v2 960 profile be created, and that it be restricted to 960 block length.
There was considerable discussion concerning the impact of the proposed change on current
licensing programs for the AAC family of technology.
David Singer, Apple, recommended that the MP4FF be extended to signal the audio profile.
Bernhard Grill, FhG, suggested that there be a break-out to discuss the future direction of audio
profiles, e.g. in what market applications 1024 or 960 should be supported. The Chair scheduled the
break-out for 2PM Tuesday, after which this discussion will be resumed.
Ralf Geiger, FhG, presented
m16127
Proposed Corrigendum on MPEG-4 SLS Conformance
Ralf Geiger
The contribution proposes that conformance information be updated to
 fix the truncated bitstream bug present in the arithmetic coding tool
 have a consistent delay within the SLS conformance
It was the consensus of the ASG to issue this as a DCOR at this meeting.
The Chair noted that there is a Systems/MP4FF mechanism to eliminate the decoder “state-variable
zero output block” and urged that the USAC decoder software support this mechanism.
Junghoe Kim
M16200
Proposed BSAC Conformance Bitstreams for Terrestrial
DMB
Miyoung Kim
Junghoe Kim
Eunmi Oh
The contribution proposes a definition for conformance streams that specifically support the BSAC
configuration used in the Terrestrial DMB (T-DMB) ETSI specification.
It is the consensus of the ASG to incorporate these definitions and conformance streams into
MPEG-4 Audio Conformance.
On behalf of Matthias Gruhne, FhG, the Audio Chair presented
m16108
Study on ISO/IEC TR 15938-8:2002/FPDAM 4
Matthias Gruhne
The contribution supplies extensive additional information concerning how STFFT coefficients are
derived from the MDCT coefficients in the audio coded representation. This includes addition
explanatory text and Matlab and C source code that provides an example implementation.
It was the consensus of the ASG to issue this as ISO/IEC TR 15938-8:2002/DAM 4.
Heiko Purnhagen, Dolby, presented
m16121
Further corrections to MPEG Surround text
Heiko Purnhagen
Jeroen Koppens
Matthias Neusinger
This contribution proposed various editorial corrections and clarifications.
It was the consensus of the ASG to issue this as a DCOR to MPEG Surround
Heiko Purnhagen, Dolby, presented
m16124
Corrections to MPEG Surround reference software
Heiko Purnhagen
Jeroen Koppens
Claus-Christian Spenger
Matthias Neusinger
It was the consensus of the ASG to issue this as a DCOR to MPEG Surround Reference Software
4.2.2
MPEG-D Spatial Audio Object Coding
Leonid Terentiev, FhG, presented
m16099
Report on corrections for the MPEG SAOC FCD text and
RM software
125
Jonas Engdegård
Heiko Purnhagen
Cornelia Falch
Leonid Terentiev
Andreas Hölzer
Oliver Hellmuth
Johannes Hilpert
Yang-Won Jung
Henney Oh
Jeroen Koppens
This contribution presents
 Editorial changes to the SAOC text consisting of correcting formatting and equation style,
spelling and abbreviation errors. In addition, mathematical expressions were corrected,
clarified and simplified.
 Bugfixes to the SAOC reference software.
It was the consensus of the ASG to incorporate the proposed changes in to a Study on the FCD.
Heiko Purnhagen, Dolby, presented
m16095
Information regarding CE on Low Delay MPEG SAOC
Jonas Engdegård
Heiko Purnhagen
Oliver Hellmuth
Johannes Hilpert
Maria Luis Valero
Andreas Hölzer
Markus Schnell
Leonid Terentiev
Erik Schuijers
Per Ekstrand
The contribution notes that adopting the AAC-ELD low-delay filterbank resulted in audible
artefacts in a number of identified critical items. Hence it proposes to use a filterbank with slightly
larger delay (from 1.3 ms to 5.3 ms), in which case artefacts are no longer audible. However this
new filterbank reduces the SAOC system delay from 26.7 ms to 5.3 ms (i.e. assuming a zero-delay
core “coder”). Various other changes are proposed in order to optimize the low delay operating
mode (parameter band grouping, decorrelators, and bit that signals low delay mode).
The proposed low-delay SAOC system achieved the following delay:
Core Coder
Total one-way system delay
AAC LD
26.6 ms
AAC ELD
21.3 ms
AAC ELD with SBR 39.0 ms
Listening test results were shown for the AAC ELD core coder using an operating mode that results
in a 21.3 ms one-way delay (as in the table above). Generally, the results showed that the low delay
mode provides audio quality that is comparable to the regular mode, in the mean and for all
individual items.
It was the consensus of the Audio Subgroup to incorporate this technology into the FCD text via a
“Study on ISO/IEC FDIS 23003-2:200x, Spatial Audio Object Coding.” The Chair noted that the
FCD ballot closes on 2009-03-14. Hence the final low delay filterbank coefficients and a “sanity
check” listening test on that final filterbank must be available sufficiently in advance of that date so
that National Bodies can take that information into account when casting their ballots. It was agreed
that the following information will be available prior to the close of the ballot
 Frequency response of interim and final prototype filters
 “Sanity check” listening test, e.g. 8 listeners for most critical items
This will be documented in the workplan for SAOC.
Finally, it was noted that the SAOC low-delay filterbank is not compatible with MPEG Surround,
however there was no reason to expect that MPEG Surround could not be extended, if desired, to
have a low-delay mode using the exactly the SAOC low-delay filterbank.
Pierrick Philippe, Orange Labs, presented
m16216
Subjective Evaluation of Low Delay MPEG SAOC
P. Philippe
This contribution presented listening test information that was very similar to the test results
provided in the previous contribution (m16095).
126
Heiko Purnhagen, Dolby, presented
m16096
Information regarding CE on Low Power MPEG SAOC
Jonas Engdegård
Heiko Purnhagen
Oliver Hellmuth
Leonid Terentiev
Erik Schuijers
This contribution provides addition information on the CE on low power SAOC. The final
configuration of the technology was clarified and a comprehensive set of listening test results
provided. Relative to the “high-quality” mode, the low power mode most prominently has
 A mixed real-valued/complex-valued filterbank (first 8 bands are complex), as is used in
low-power MPEG Surround.
 Low-complexity decorrelator for mono downmix to stereo output, as is used in low-power
MPEG Surround.
 No decorrelator for stereo downmix to stereo output.
 Reduced-bandwidth residual, as is used in low-power MPEG Surround.
 A 50 % reduction in computational complexity with respect to “normal” SAOC.
Overall, the performance of the low-power mode demonstrated a slightly lower mean performance,
but not at the 95% level of significance.
It was noted that the “reduced-bandwidth residual” was not tested for the advanced karaoke
application. A Workplan will coordinate the tasks for FhG to create test waveforms and LG to
conducting a listening test.
Interoperability and naming of the various “flavors” of SAOC is summarized in the following table,
where an “X” indicates that a given bitstream can be decoded by a given decoder. Note that HighQuality and Low-Power bitstreams are identical.
Decoder
Bitstream HQ LP LD
HQ/LP
X
X
LD
X
Kwang-Ki Kim, ETRI, presented
m16084
CE on Residual Coding Process for Post Downmix Gain
Kwang-Ki Kim
Jeongil Seo
Seungkwon Beack
Kyeongok Kang
MinsooHahn
This contribution provides listening test data that shows that a residual signal with a bandwidth
equal to the first 2 QMF bands improves audio quality at the 95% level of significance in this
application area. The contribution also reports on a cross-check listening test done at LG.
The Chair noted that the SAOC processing will remove the mastering “feel” even before any sound
stage re-mixing is applied, so that the user will experience a very different feel as compared to the
stereo mastered mix. There was considerable discussion as to what the user experience would be
when using the proposed technology and whether the proposed technology would bring the
expected value to the consumer experience. There was further discussion as to the increase in
complexity, if any, that the proposed technology would bring. The Chair suggested that ETRI bring
additional information later in the week that would clearly indicate the evolution of technology and
performance over the course of the CE, and this will be discussed later in the week.
Zhong Haishan, Panasonic, presented
m16086
Efficient inter-object relation indicator for SAOC
Zhong Haishan
Zhou Huan
Chong Kok Seng
Tomokazu Ishikawa
Takeshi Norimatsu
The contribution proposes a syntax that supports both a “flexible” and an “efficient” way of
indicating inter-object relation. Leondid Terentiev, FhG, noted that the contribution does not take
127
into account the fact that the Config structure is always padded out to an integer number of bits (i.e.
via byte_align()), which may significantly change the bit savings count. Heiko Purnhagen, Dolby,
noted that it is informative to measure bit savings WRT an entire bitstream under the assumption of
a single Config (i.e. file-based decoding) or periodic Config (i.e. transmission-based decoding
supporting break-in).
Panasonic experts will bring additional information later in the week that addresses the bytealignment issue and the periodic transmission issue.
Leonid Terentiev, FhG, presented
m16097
Information regarding mixing mode for the enhanced
Karaoke/Solo processing
Cornelia Falch
Leonid Terentiev
Johannes Hilpert
Oliver Hellmuth
The contribution proposes an extension to the Karaoke/Solo mode that allows for more efficient
processing in the case that an arbitrary mix is used for the Karaoke/Solo separation.
It was the consensus of Audio Subgroup to include this technology into the Study on SAOC FCD
text.
Leonid Terentiev, FhG, presented
m16098
Proposal for MCU functionality extension for the MPEG
SAOC
Leonid Terentiev
Cornelia Falch
Oliver Hellmuth
This contribution provides test clarifying
 MCU functionalities
 MCU operations (i.e. as mathematical equations)
 MCU control interface
as was requested at the 86th MPEG meeting.
It was the consensus of the Audio Subgroup to incorporate this additional information into the
Study on SAOC FCD text.
Yang-Won Jung, LG, presented
m16100
Proposal for dynamic preset extension for the MPEG SAOC Heiko Purnhagen
Cornelia Falch
Leonid Terentiev
Oliver Hellmuth
Johannes Hilpert
Yang-Won Jung
Henney Oh
Jeroen Koppens
This contribution proposed dynamic presets. Since presents are uniquely identified by a label string,
it is proposed that when a decoder receives a preset with an already known label, it overwrites the
currently stored preset settings with the new information.
Heiko Purnhagen, Dolby, noted that there are a few details still to be specified in order to have a
rational and deterministic system. He will lead a break-out group to study these issues and report
back to the group.
It was the consensus of Audio Subgroup to adopt this technology into the Study on SAOC FCD text,
subject to a positive report from Heiko Purnhagen.
Yang-Won Jung, LG, presented
m16103
Consideration on User Interface in SAOC
Yang-Won Jung
Henney Oh
This contribution describes several possible user interface controls that an SAOC decoder might use
in different application scenarios. There was considerable discussion, in which it was pointed out
that MPEG might offer tutorial examples of how to determine a rendering matrix, i.e. as
mathematical expressions. LG will draft candidate informative text for review later in the week.
Yang-Won Jung, LG, presented
m16104
Proposal for adding information on object characteristics in
SAOC
128
Yang-Won Jung
Henney Oh
The contribution noted that it may be valuable for the decoder to have some knowledge of the
nature of the objects contained in the downmix signal. It proposes to add syntax constructs in the
Config() structure to carry such information.
It was noted that the proposal does not seem to have a computational-based algorithm to deduce the
information and has no normative decoding aspect.
The Chair suggested that this matter requires more discussion in a break-out group, and asked that
group to report after lunch Thursday.
Yang-Won Jung, LG, presented
m16105
Proposal for including guideline information on the rendering Yang-Won Jung
parameters in SAOC
Henney Oh
The contribution proposes an informative method to influence, or even limit, the user’s choice of
rendering matrix. Informative guide would take the form of mathematical expressions.
The Chair noted that a possible mechanism to support this need and that expressed in the previous
contribution is to define a User XML Data box that could contain arbitrary information, perhaps
based on a normative set of tags.
The Chair suggested that this matter requires more discussion in a break-out group, and asked that
group to report after lunch Thursday.
Yang-Won Jung, LG, presented
m16106
Comments on the enhanced karaoke mode in SAOC
Yang-Won Jung
Henney Oh
This contribution notes that in “normal” SAOC mode it is possible to apply a small amount of
control to every object via a small rate of additional side information per object. In order to achieve
Karaoke/Solo function, a residual signal is required for “foreground” and “background” objects.
The contribution proposes to
 unify the normal mode and EKS mode
 transmit information in the bitsream that indicates which object is matched to which residual
signal.
This proposal was discussed, particularly the validity of the assumptions taken by the contribution
It was clarified that ENG mode and residuals typically are used together, with residual being bandlimited (e.g. first 8 bands) and ENG being used in the higher bands (which there is no residual
signal).
The Chair suggested that this matter requires more discussion in a break-out group, and asked that
group to report after lunch Thursday.
Pierrick Philippe, Orange Labs, presented
m16107
Proposed Audio Sequences for MPEG-D SAOC
Pierrick Philippe
Gregory Pallone
Marc Emerit
The contribution reports that Orange Labs is making available to MPEG, for the purpose of
developing MPEG standards, audio signals that are appropriate for assessing performance of SAOC
in a teleconferencing scenario. These include signals with
 up to 5 talkers, each recorded individually in an acoustically isolated environment
 background music
 talkers are involved in an actual task, which might optionally involve the background music
item (e.g. interactively guess the song title, or conduct a technical meeting).
Items are available on a USB stick (some 80 Mbytes).
Break-Out on SAOC (Oliver Hellmuth)
These contributions have already been presented, and addition discussion occurred in the break-out
group, and the recommendations of the break-out are reported here:
m16084, ETRI, CE on Residual Coding Process for Post Downmix Gain
 Value not clear. Proposal: no action
129
m16103, LG, Consideration on user interface in SAOC
 Proposal: Put guideline into informative annex
m16104, LG, Proposal for adding information on object characteristics in SAOC
 Proposal: Use existing BsRelatedTo + additional informative text (to give an idea of the
functionality)
m16105l, LG, Proposal for including guideline information on the rendering parameters in SAOC
 Problem understood, but not solved with the proposed solution. Proposal: no action
m16106, LG, Comments on the enhanced karaoke mode in SAOC
 Proposal: FhG to provide input to the next meeting: Clarification of the "Study On FCD"
text based on questions raised in this input contribution
m16086, Panasonic, Proposal on efficient inter-object relation indicator for SAOC
 Minimal saving relative to overall bit rate. Proposal: no action
m16100, Dolby, FhG, Philips, Proposal for dynamic preset extension for the MPEG SAOC
 Proposal: Include necessary clarifications of the input contribution and the related
discussion in the "Study On FCD" text within the editing period
4.2.3
MPEG-D Unified Speech and Audio
USAC Core Experiments
Kristofer Kjörling, Dolby, presented
m16142
Core experiment proposal on the USAC eSBR module
Lars Villemoes
Per Ekstrand
Kristofer Kjörling
The contribution notes that there is audible pre-echo when using the SBR tool at 16 kb/s for some
mono signals, e.g. “castanets.” It gives an overview of the harmonic transposition algorithm as used
in USAC SBR, and notes that when the signals are pulse-like rather than sinusoidal and harmonic,
care must be taken when doing the time-frequency transforms that are part of constructing the
harmonic extension. Spectrograms where presented that showed the presence of pre-echo in the
current processing mode and their absence in the proposed mode.
Listening test results were presented for an extended test set (adding castanets and harpsichord).
Results suggest that the technology has merit, but may have a significant increase in complexity,
with the complexity increase depending on how one arranges the alignment of windows and
perhaps other factors.
Dolby experts will work with the RM0 proponents to refine this proposal and bring additional
information to the next MPEG meeting.
Philippe Gournay, VoiceAge, presented
m16147
Proposed Core Experiment on LPC Quantization for USAC
Philippe Gournay
Bruno Bessette
Roch Lefebvre
Redwan Salami
The contribution reviewed the RM0 LPC vector quantizer, and proposes an alternative solution
which has significantly lower table storage requirements. In addition, the proposed solution has a
more flexible bit allocation scheme that permits using more bits when needed such that there are
fewer quantized LPC models large spectral distortion (relative to the unquantized model).
Listening test results were presented at the 16 kb/s mono operating mode, showing that the
proposed system had quality that was not different from the RM0 system.
130
It was suggested that additional information be provided to aid the Audio Subgroup in making a
decision. This could be:
 cross-check listening test at 16 kb/s for mono signals
 proponent and cross-check listening test at some additional operating mode(s)
 listening test methodology might be MUSHRA and also CMOS
 spectral distortion information at all operating points
Markus Multrus, FhG, presented
m16162
Proposed Update of Arithmetic Coder Tables for USAC
Guillaume Fuchs
Markus Multrus
The contribution reviews the current context-adaptive arithmetic coder, which requires 105K words
of storage (75% of total table storage in decoder). It proposes to replace the existing tables with new
tables whose total size is 15 K words, which is a reduction by a factor of 7. This is achieved by
reducing the number of probability distributions from 256 to 32.
It is possible to create an “encoder” via a lossless transcoding of the RM0 bitstreams to the
proposed bitstream format and to decode these by the proposed decoder. It was verified that both
decoders outputs were bit-identical waveforms. When averaged over all test items and all operating
points, the new arithmetic coding scheme had the same bitrate (identical to 2 decimal digits). It was
further noted that computational complexity is comparable.
Bernhard Grill, FhG, noted that the RM0 arithmetic coding design was crude and preliminary, and
this proposal is based on a careful analysis of the problem and subsequent optimization.
Samsung offered to conduct a cross-check of the decoded waveforms and the bitrate changes as
presented in the contribution.
The Chair suggested that, if that cross-check is available one week after the close of the MPEG
meeting, and if it is positive, then the ASG could agree at this meeting to incorporate the proposed
technology into the USAC WD and RM.
Audio experts will draft a Workplan for USAC CEs that coordinates work for the following CEs:
Company CE
Samsung Phase in stereo coding
Samsung Additional mode of unvoiced speech coding
ETRI
SBR
ETRI
TCX
Dolby
SBR harmonic extension
VoiceAge LPC VQ
FhG
Arithmetic coding tables
MPEG Encoder and the CE process
This was a continued discussion of how the USAC source code encoder could be used in the CE
process. Mohamad Raad, RaadTech Consulting, made a presentation on this topic.
The proposal emphasises the following point:
 An additional component of the CE integration phase, the proponent must provide sufficient
support, as educational text and/or exemplary source code so as to enable others to
implement the tool in the MPEG Reference Encoder. The “support” component must be
accepted by the consensus of the Audio Subgroup.
In addition, the proposal requests that the RM0 proponents enhance the current Informative Encoder
source code such that there is code for all modules in the encoder block diagram in the informative
part of the WD.
The ASG agrees with the following plan of action:
Draft Workplan to
 define all modules
 categorize modules as signal flow or control
131


identify what modules are present or missing in the Informative Encoder source
evaluate the informative encoder description in the WD and judge if it provides sufficient
“support”
Update CE methodology to incorporate
 An additional component of the CE integration phase, the proponent must provide sufficient
support, as educational text and/or exemplary source code so as to enable others to
implement the tool in the MPEG Reference Encoder. The “support” component must be
accepted by the consensus of the Audio Subgroup.
The Chair requests that RM0 proponents submit contribution that specifies the areas, if any, that
they could provide missing or improved source code.
Junghoe Kim, Samsung, presented
m16055
Comments on WD of Unified Speech and Audio Coding
Kihyun Choo
Junghoe Kim
Eunmi Oh
This contribution suggested
 editorial corrections and clarifications
 alignment of text to ref sw for noise filling tool
 minor technical changes to “acelp_core_mode” signalling that slightly reduce bitrate
It further notes that the RM0 CfP bitstreams have undefined bits at the end of
usac_raw_data_block(). This appears to be an extension payload element.
It is the consensus of the ASG to incorporate the following into the USAC WD text:
 editorial corrections and clarifications
 alignment of text to ref sw for noise filling tool
Hyun-Kook Lee, LG, presented
m16119
Proposed syntax revision on USAC RM0
Dong Soo Kim
Sungyong Yoon
Hyun-Kook Lee
Jaehyun Lim
This contribution suggested
 minor technical changes to “acelp_core_mode” signalling that slightly reduce bitrate
It is the consensus of the ASG to incorporate the comments from Samsung and LG on
“acelp_core_mode” signalling into a document titled “Thoughts on Efficient Bitstream Syntax.”
The ASG will consider the syntax change proposals in this document no later than the meeting at
which USAC progresses to CD stage. This does not imply that the ASG will take any action on the
proposals, but rather than each will be considered on the basis of its technical merit.
USAC CE Break-out (Markus Multrus)
Hyun-Kook Lee, LG, continued his presentation of
m16119
Proposed syntax revision on USAC RM0
Dong Soo Kim
Sungyong Yoon
Hyun-Kook Lee
Jaehyun Lim
The main points of this contribution were:
 1st Issue on acelp_core_mode and lpd_mode already talked through before the break-out
 2nd issue on the transmission of noise_offset parameter in case noise_level equals 0 (in
fd_channel_streams)
o In case noise_level == 0, noise_offset is not used
o Max Neuendorf, FhG proposed, to collect this proposal in the "Thoughts on Efficient
Bitstream Syntax" document. There was no agreement on this so the decision was
postponed.
132
o Roch Lefebvre, VoiceAge, suggested that the group concentrate on other CEs that
have a more significant impact on the performance of the work item.
It was the consensus of the Audio Subgroup that this technology would be included in the
“Thoughts on Efficient Bitstream Syntax” document.
Hyun-Kook Lee, LG, presented
m16122
Efficient signaling for FD frame on USAC RM0
Dong Soo Kim
Sungyong Yoon
Hyun-Kook Lee
Jaehyun Lim
The main points of this contribution were:
 Proposal to apply differential coding for global_gain, max_sfb. Difference should be
Huffman-coded
 Syntax revision in single_channel_elemnt(), channel_pair_element(), ics_info()
 Differential coding could be turned on/off. This is signalled by an additional bit in the
bitstream
 Bitrate savings for 2 scenarios were presented: Storage scenario (up 0.81% saving),
streaming scenario (up to 0.78%)
 Ralf Geiger pointed out that this proposal infringes the random-access of AUs: parsing of
syntax is infringed
 Bernhard Grill pointed out the importance of the correct global_gain for decoding and level
adjustment: increament/decreament by 1 issues a level difference for 1.5 dB
 Ralf Geiger also pointed out that the correct decoding of max_sfb is important for paring of
the frame
 Tilman Liebchen pointed out the bitrate savings.
 Werner Oomen proposed to include this issue into the "Thoughts on Efficient Bitstream
Syntax" document
 Max Neuendorf repeated the importance of the self-containedness for the bitstream parsing
 There was no consensus to put issue into the "Thoughts on Efficient Bitstream Syntax"
document
 Advantages and disadvantages were listed by the presenter:
o Advantage: In average bitrate saving
o Disadvantge: Break up self-containedness (ER)
o Disadvantage: Bitrate saving not guaranteed
o For Random-Access: Even bitrate increment
o Disadvantage: Loss of functionality (level adjustment)
It was the consensus of the Audio Subgroup that, for now, this technology would not be included in
the “Thoughts on Efficient Bitstream Syntax” document.
Hyun-Kook Lee, LG, presented
m16125
Proposed syntax revision regarding window sequence on
USAC RM0
Dong Soo Kim
Sungyong Yoon
Hyun-Kook Lee
Jaehyun Lim
The main points of this contribution were:
 Proposal to entropy code window_sequence for USAC
 In RM0: fixed 2 bits
 In proposal: 1..2 bits
 Conflicts with m16153, m16154
 Bitsavings 0.060% … 0.099% for all FD frames, 0.032%..0.067% in total
 Proposal in case the proposed corrections to WD are applied, proposal not possible any
more (more than 3 possibilities): 4 transitions present in bitstreams
 Conflicts with m16154
133

Problems with Random-Access: if last frame is lost, not clear if long/short frame: Frame not
parsable
It was the consensus of the Audio Subgroup that, for now, this technology would not be included in
the “Thoughts on Efficient Bitstream Syntax” document.
Hyun-Kook Lee, LG, presented
m16123
Comment on random access issue on USAC RM0
Dong Soo Kim
Sungyong Yoon
Hyun-Kook Lee
Jaehyun Lim
The main points of this contribution were:
 In current RM0: Frame depends on previous frame
 Correct Decoding of frame not possible in case last frame is lost
 Proposal:
include
last_core_mode,
last_window_sequence
in
header
(USACSpecificConfig())
 Werner Oomen pointed out that decoding of 1st frame is not completely possible , because
e.g. filterbank states are missing
 Problem: Start-up only in case header is received
 Previous frame needed to decode current frame
 Take discussion offline
Ralf Geiger, FhG, presented
m16154
Proposed Update on USAC Bitstream Syntax
Jérémie Lecomte
Max Neuendorf
Ralf Geiger
Markus Multrus
The main points of this contribution were:
 Alternative signalling for window_sequence, which simplifies, saves bitrates
 In RM0: window_sequence coded by 2 bits, info from previous frame needed to determine
correct seuqnece
 Proposed sequence: 1..2 bits, indicates transform length, right window slope length; length
of left window slope derived from previous window, incorporated in ics_info()
 Proposed update: No restrictions to window signalling as in RM0
 Bitrate reduction between 4.41 … 22.41 bit/second
 Additional presentation on "Requirements for window signaling":
o Random access: Bitstream must be parsable, right window half must be preserved
o Efficient syntax with low redundancy
 Comparison of RM0, m16123, m16125, m16154 wrt random-access: a) bitstream syntax
self contained, b) random-access, c) perfect reconstruction, d) efficient syntax, easy to
understand
o RM0: a), b)
o M16123: a), b) (b with restrictions)
o M16125: c)
o M16154: a), b), c), d)
 Discussion on need of this proposal: Kristofer Kjörling pointed out the agreement not to go
after maximum bitrate saving on current syntax; Max Neuendorf explained the proposed
scheme as new concept of window signalling: Treat every proposal equal
 Take discussion offline
After offline discussion between Ralf Geiger, FhG, and Hyun-Kook Lee, LG, it was agreed to include FhG’s m16154
proposal rather than LG's m16123 proposal in the “Thoughts on Efficient Bitstream Syntax” document. This was
approved by the Audio Subgroup upon presentation of the “Thoughts on Efficient Bitstream Syntax” document in the
Friday Audio Plenary.
Hyun-Kook Lee, LG, presented
134
m16120
Proposed syntax revision regarding SBR bitstream on
USAC RM0
Dong Soo Kim
Sungyong Yoon
Hyun-Kook Lee
Jaehyun Lim
The main points of this contribution were:
 Propose to reduce redundancy in bs_frame_class, bs_var_bord
 In RM0: bs_frame_class: 2bits
 Proposal 1: replace it by 1 bit, dependent on frame_class of previous frame
 Proposal 2: bs_var_bord_0: same as bs_var_bord_1 of previous frame (indicates the borders
of SBR framing), could so omitted
 transmit both values "self-containing" in case of (frame_refresh != 0); frame_refersh is set in
case of header transmission
 Discussion by Kristofer Kjörling: Not all frame class combinations can be signalled by
proposal 1, e.g. not VARVAR succeeding FIXFIX, which are valid in SBR, proposal 2
prevents check for transmission errors on decoder side
 No further actions on this. There was a proposal that Kristofer Kjörling should come up with
a tutorial on SBR frame class transitions.
Reference Encoder Software (continued from AhG meeting)
Mohamad Raad, RaadTech Consulting, presented
M16208
Proposal for the development of a common MPEG Audio
encoder for use in the CE phase
Mohamad Raad
The contribution notes that there should be a common USAC encoder that will be used for the CE
process. To this end, it presents a detailed process on how to organize and manage the project of
creating such software.
Justin Ridge, Nokia, presented
m16166
On the Unified Speech and Audio Coding Activity
Mauri Väänänen
This contribution made the following points:
 That the USAC work item should have the goal of significant improvement of performance,
and that improvement of worst-case performance is most important (i.e. so that consistent
performance is achieved).
 That complexity is important (i.e. so that implementations for battery-powered devices are
possible)
 That it is more important to achieve significant improvement of performance than achieve
quick time to market.
It further notes that a high-quality standard is more assured if
 There is a low threshold of participation
 There is a high threshold of acceptance (for tools proposed in CEs)
The contribution presents an example process for how the USAC CE process might proceed using
both proprietary Reference Quality Encoder and “open-source” Reference Encoder.
The Chair welcomed the opinions of Nokia as a major device manufacturer, and urged audio
experts to “aim high” in order to be successful in the marketplace.
4.2.4
Exploration: Meta-Data
Stephan Schreiner, FhG, presented
m16144
Perspectives on Application Scenarios for Post-Processing
Audio Metadata
Stephan Schreiner
Wolfgang Fiesel
Akshaya Thippur
This contribution presents possible application scenarios for the use of metadata in the context of
audio post-processing. It noted that, historically, there may have been a well-defined and wellcontrolled closed environment in which a content producer could exactly predict and control the
quality of the user experience. As content production, program assembly for transmission and user
135
listening environments becomes much more dynamic and heterogeneous, the assumption that a
content producer can predict and precisely control the quality of the user experience is no longer
true.
It specifically discussed
 “Comfort Zone” adjustment, in which the dynamic range of an audio program is controlled
and adjusted
 “Clean Audio” enhancement, in which the dialog component of an audio program can be
boosted relative to the remaining program audio elements.
It proposes that there is now the possibility to serve the discussed applications using a unique and
efficient manner based on the concept of audio objects and associated parametric representations.
Bernhard Grill, FhG, suggested that a liaison statement to DVB groups that could solicit their vision
for future-looking system functionality that might be enabled by audio-related metadata. The Chair
suggested that there should be an AhG mandate to craft candidate liaison text and/or “vision”
document that could be attached to liaison statement.
Kristofer Kjörling, Dolby, noted that often the dialog is often not available as a distinct program
component, and so computing the dialog metadata can be a difficult problem. Bernhard Grill, FhG,
noted that the MPEG experts should think about the technology that could be used to support the
envisoned funcitonality
5
Audio closing plenary discussions
Max Neuendorf, FhG, presented additional information on proposed changed to the USAC WD
 TW- MDCT
o Segmental SNR plots show that change does not affect a 16-bit reconstruction of the
output waveform
 For all proposed changes taken together
o SNR and SegSNR show reasonable performance
o Listening tests show no difference between RM with and without the proposed
changes
 Additional proposed changes to the mathematical formulas were proposed
 Add additional details to Figure 1.1, Encoder Block Diagram.
The Chair requested that the new Figure 1.1 be posted to the USAC email reflector and that it be
used in the Workplan for developing the MPEG Reference Encoder.
It was the consensus of the Audio Subgroup to incorporate these proposed changes into USAC
WD2.
6
6.1
Meeting deliverables
Responses to Liaison and NB comments
The responses to Liaison and NB comments were prepared and approved.
6.2
Recommendations for final plenary
The Audio recommendations were presented and approved.
6.3
Establishment of Ad-hoc Groups
The following ad-hoc groups were established by the Audio subgroup:
No.
Title
AHG on Audio Standards Maintenance
AHG on Unified Speech and Audio Coding and Spatial Audio
Object Coding
136
Mtg
No
Yes
6.4
Approval of output documents
All output documents, shown in Annex D, were presented in Audio plenary and were approved.
6.5
Press statement
The Audio contribution to the press statement was presented. Editing and further review will be done via email.
7
7.1
Future activities
Schedule of future meetings
Ad Hoc group meetings are indicated in Section 6.3. Unless otherwise indicated, Ad Hoc group
meetings will be held at the location of the next MPEG meeting on the weekend preceding that
meeting.
7.2
Agenda for next meeting
The agenda for the next MPEG meeting is shown in Annex F.
7.3
All other business
There was none.
7.4
The 87th
Closing of the meeting
Audio Subgroup meeting was adjourned Friday at 13:45.
137
Annex A Participants
First Name
Last Name
Country
Affiliation
Johannes
Boehm
DE
Thomson
Ti Eu
Chan
SG
I2R
Yujie
Dun
CN
XJTU
Ralf
Geiger
DE
Fraunhofer IIS
Philippe
Gournay
Canada
Bernhard
Grill
DE
VoiceAge Corp. / Univ. of
Sherbrooke
Fraunhofer IIS
Oliver
Hellmuth
DE
Fraunhofer IIS
Jürgen
Herre
DE
Fraunhofer IIS
Jeff
Huang
USA
Qualcomm Inc.
Yang-Won
Jung
KR
LG Electronics
Kyeong Ok
Kang
Korea
ETRI
Florian
Keiler
Germany
Thomson
Kei
Kikuiri
JP
NTT DOCOMO
Junghoe
Kim
KR
Samsung AIT
Kwangki
Kim
KR
Kristofer
Kjörling
SE
Information and Communications
Univ.
Dolby
Hyunkook
Lee
KR
LG electronics
Taejin
Lee
KR
ETRI
Roch
Lefebvre
Canada
Tilman
Liebchen
DE
VoiceAge Corp. / Univ. of
Sherbrooke
LG Electronics
Takehiro
Moriya
JP
NTT
Markus
Multrus
DE
Fraunhofer IIS
Yasushige
Nakayama
JP
NHK
Max
Neuendorf
Germany
Fraunhofer IIS
Toshiyuki
Nomura
JP
NEC
Takeshi
Norimatsu
JP
Panasonic
Eunmi
Oh
KR
Samsung
Henney
Oh
KR
LG Electronics
Werner
Oomen
NL
Philips Applied Technologies
Pierrick
Philippe
FR
France Telecom R&D
Heiko
Purnhagen
SE
Dolby
Schuyler
Quackenbush
USA
ARL
Mohamad
Raad
Australia
RaadTech Consulting
Andreas
Schneider
DE
Dolby
Stephan
Schreiner
Germany
Fraunhofer IIS
Jeongil
Seo
KR
ETRI
Haiyan
SHU
Singapore
I2R
Ralph
Sperschneider
DE
Fraunhofer IIS
Herve
Taddei
DE
Huawei Technologies
Leonid
Terentiev
DE
Fraunhofer IIS
Mauri
Vaananen
FIN
Nokia Res. Center
David
Virette
FR
France Telecom R&D
Minjie
Xie
USA
Huawei
Hai Shan
Zhong
Singapore
Panasonic Singapore Laboratories
Huan
Zhou
SG
Panasonic Singapore Laboratories
Yongwei
Zhu
Singapore
Institute for Infcomm Resarch
138
Annex B Audio Contributions and Schedule
Day / Time
Task Group
Sunday
1000-1700
AhG: USAC
m16156
Progress of Technology Merge Between System 2 and
USAC RM
Taejin Lee
Max Neuendorf
Jeremie Lecomte
Kyeongok Kang
Bernhard Grill
X
m16173
Progress report on phase experiment for USAC
JungHoe Kim
Julien Robilliard
Eunmi Oh
Bernhard Grill
X
m16177
Progress report on unvoiced speech coding
Hosang Sung
Eunmi Oh
X
m16153
Proposed Corrections to WD and Reference Software
on Unified Speech and Audio Coding
Max Neuendorf
Philippe Gournay
Jérémie Lecomte
Markus Multrus
Stefan Bayer
Guillaume Fuchs
Ralf Geiger
Frederik Nagel
X
13001400
Lunch
m16079
Discussion on the Unified Speech and Audio Coding
Activity
Herve Taddei
Minjie Xie
Qing Zhang
X
m16110
Comment on the unified speech and audio coding
activity
Werner Oomen
X
m16118
Considerations on the development of common USAC
reference encoder
Dong Soo Kim
Sungyong Yoon
Hyun-Kook Lee
Jaehyun Lim
X
m16140
Core Experiment procedures and MPEG reference
software encoder
Kristofer Kjörling
Heiko Purnhagen
X
m16143
Proposed improvements for MPEG Audio Core
Experiment Methodology and Reference Software
Development
Pierrick Philippe
X
m16146
Comments on Core Experiments methodology for
MPEG USAC standardisation
Roch Lefebvre
Philippe Gournay
Redwan Salami
X
m16160
Thoughts on Core-Experiment Methodology
Bernhard Grill
Jürgen Herre
Ralf Geiger
Max Neuendorf
Markus Multrus
X
m16202
Comments on USAC Standardization Activities
Kei Kikuiri
Nobuhiko Naka
Kousuke Tsujino
X
1800-
Chairs Meeting
Monday
0900-1230
MPEG Plenary
1300-1400
Lunch
1400-1430
Audio Plenary
Welcome
139
Report on Sunday Chairs meeting
Review main tasks for the week
General documents
m16116
86th MPEG Audio Report
S. Quackenbush
X
m15952
Ad Hoc Group on Audio Standards Maintenance
R. Sperschneider
X
m15953
Ad Hoc Group on SAOC, USAC
S. Quackenbush
X
NB Position Papers
m16044
AUNB Comment on the unified speech and audio
coding activity
AUNB
X
m16045
AUNB Comment on the exploration on metadata driven AUNB
post-processing of audio
X
m15914
FNB Comment (from Busan) on USAC
FNB
m15879
Fin NB
m15911
CNB
1430-
USAC Reference Software
M16208
Proposal for the development of a common MPEG
Audio encoder for use in the CE phase
Mohamad Raad
X
m16166
On the Unified Speech and Audio Coding Activity
Mauri Väänänen
X
Discussion
1630-1800
MPEG-2, MPEG-4 and MPEG-7
m16036
Proposed Text of ISO/IEC 144963:2005/Amd.2:2006/DCOR4
Noboru Harada
Tilman Liebchen
Takehiro Moriya
Yutaka Kamamoto
X
m16056
Proposed Draft Corrigendum on AAC-ELD
Markus Schnell
Per Ekstrand
X
m16127
Proposed Corrigendum on MPEG-4 SLS Conformance
Ralf Geiger
X
M16200
Proposed BSAC Conformance Bitstreams for Terrestrial Miyoung Kim
DMB
Junghoe Kim
Eunmi Oh
X
m16141
Proposal for splitting the current AAC family profiles into Kristofer Kjörling
two
Andreas Schneider
X
1800-
HoD Meeting
Tuesday
0900-1300
MPEG-7
m16108
Study on ISO/IEC TR 15938-8:2002/FPDAM 4
0930-1300
SAOC
m16099
m16095
Matthias Gruhne
X
Report on corrections for the MPEG SAOC FCD text
and RM software
Jonas Engdegård
Heiko Purnhagen
Cornelia Falch
Leonid Terentiev
Andreas Hölzer
Oliver Hellmuth
Johannes Hilpert
Yang-Won Jung
Henney Oh
Jeroen Koppens
X
Information regarding CE on Low Delay MPEG SAOC
Jonas Engdegård
Heiko Purnhagen
X
140
Oliver Hellmuth
Johannes Hilpert
Maria Luis Valero
Andreas Hölzer
Markus Schnell
Leonid Terentiev
Erik Schuijers
Per Ekstrand
m16216
Subjective Evaluation of Low Delay MPEG SAOC
P. Philippe
X
m16096
Information regarding CE on Low Power MPEG SAOC
Jonas Engdegård
Heiko Purnhagen
Oliver Hellmuth
Leonid Terentiev
Erik Schuijers
X
m16084
CE on Residual Coding Process for Post Downmix Gain Kwang-Ki Kim
Jeongil Seo
Seungkwon Beack
Kyeongok Kang
MinsooHahn
X
m16086
Efficient inter-object relation indicator for SAOC
Zhong Haishan
Zhou Huan
Chong Kok Seng
Tomokazu Ishikawa
Takeshi Norimatsu
X
m16097
Information regarding mixing mode for the enhanced
Karaoke/Solo processing
Cornelia Falch
Leonid Terentiev
Johannes Hilpert
Oliver Hellmuth
X
1300-1400
Lunch
1400-1500
Future directions in Audio profiles
1400-1600
SAOC
m16098
Proposal for MCU functionality extension for the MPEG Leonid Terentiev
SAOC
Cornelia Falch
Oliver Hellmuth
X
m16100
Proposal for dynamic preset extension for the MPEG
SAOC
Heiko Purnhagen
Cornelia Falch
Leonid Terentiev
Oliver Hellmuth
Johannes Hilpert
Yang-Won Jung
Henney Oh
Jeroen Koppens
X
m16103
Consideration on User Interface in SAOC
Yang-Won Jung
Henney Oh
X
1600-1800
USAC Reference Encoder and the CE process
1800-
Chairs Meeting
Stephan Schreiner
Wolfgang Fiesel
Akshaya Thippur
X
Frans de Bont
Stefan Döhla
X
Wednesday
0900-1100
MPEG Plenary
1200-1300
Exploration: Metadata
m16144
Perspectives on Application Scenarios for PostProcessing Audio Metadata
1300-1400
Lunch
1400-1500
MPEG Surround – Joint with Systems
m16117
Thoughts on MPEG Surround signaling
141
Heiko Purnhagen
Alexander Gröschel
m16054
Scalable Audio and MP4
Stefan Doehla
Discussion: signaling audio profile in MP4FF
X
X
1500-1530
MPEG Surround
m16121
Further corrections to MPEG Surround text
Heiko Purnhagen
Jeroen Koppens
Matthias Neusinger
X
m16124
Corrections to MPEG Surround reference software
Heiko Purnhagen
Jeroen Koppens
Claus-Christian Spenger
Matthias Neusinger
X
1400-1500
SAOC
m16104
Proposal for adding information on object
characteristics in SAOC
Yang-Won Jung
Henney Oh
X
m16105
Proposal for including guideline information on the
rendering parameters in SAOC
Yang-Won Jung
Henney Oh
X
m16106
Comments on the enhanced karaoke mode in SAOC
Yang-Won Jung
Henney Oh
X
m16107
Proposed Audio Sequences for MPEG-D SAOC
Pierrick Philippe
Gregory Pallone
Marc Emerit
X
1600-1700
USAC
m16142
Core experiment proposal on the USAC eSBR module
Lars Villemoes
Per Ekstrand
Kristofer Kjörling
X
1900
Social (don’t be late!)
Thursday
0900-1300
USAC
m16147
Proposed Core Experiment on LPC Quantization for
USAC
Philippe Gournay
Bruno Bessette
Roch Lefebvre
Redwan Salami
X
m16162
Proposed Update of Arithmetic Coder Tables for USAC Guillaume Fuchs
Markus Multrus
x
m16055
Comments on WD of Unified Speech and Audio Coding Kihyun Choo
Junghoe Kim
Eunmi Oh
X
USAC MPEG Reference Encoder discussion
m16119
Proposed syntax revision on USAC RM0
Dong Soo Kim
Sungyong Yoon
Hyun-Kook Lee
Jaehyun Lim
X
m16120
Proposed syntax revision regarding SBR bitstream on
USAC RM0
Dong Soo Kim
Sungyong Yoon
Hyun-Kook Lee
Jaehyun Lim
X
m16122
Efficient signaling for FD frame on USAC RM0
Dong Soo Kim
Sungyong Yoon
Hyun-Kook Lee
Jaehyun Lim
X
m16123
Comment on random access issue on USAC RM0
Dong Soo Kim
Sungyong Yoon
Hyun-Kook Lee
Jaehyun Lim
X
m16125
Proposed syntax revision regarding window sequence
on USAC RM0
Dong Soo Kim
Sungyong Yoon
Hyun-Kook Lee
X
142
Jaehyun Lim
m16154
Proposed Update on USAC Bitstream Syntax
1300-1400
Lunch
1400-1800
TBD
1500-1530
Audio for HVC
1800-
Chairs Meeting
Audio plenary
Remarks on Thursday Chairs meeting
Recommendations for final plenary
Establishment of new Ad-hoc groups
AhG Mandates
Get document numbers
1000
Approve Responses to NB comments and Liaison
1030
Approval of output documents
Title:
N10xxx
File:
w10xxx (short title).doc (NOT *.docx!)
Zip:
w10xxx.zip
Review of Audio presentation to MPEG plenary
Agenda for next meeting
A.O.B.
Closing of the Audio meeting
1300-1400
Lunch
1400-
MPEG Plenary
X
X
Friday
0730-1300
Jérémie Lecomte
Max Neuendorf
Ralf Geiger
Markus Multrus
143
Annex C Task Groups
1.
2.
3.
4.
MPEG-2 and MPEG-4 audio, conformance, reference software
MPEG-D Spatial Audio Object Coding
MPEG-D Unified Speech and Audio Coding
Exploration: Meta-Data
144
Annex D Output Documents
No.
10373
10374
10375
10376
10377
10378
10379
10380
10381
10382
10383
10384
10385
10386
10434
10387
10388
10389
10390
10391
10392
10393
10394
10395
10396
10397
10398
10399
10413
10414
10415
Title
13818-4 Conformance
DoC on ISO/IEC 13818-4:2004/AMD 2:2005/DCOR 2, AAC
Conformance
ISO/IEC 13818-4:2004/AMD 2:2005/Cor 2, AAC Conformance
13818-7 Advanced Audio Coding
DoC on ISO/IEC 13818-7:2006/DCOR 1, AAC
ISO/IEC 13818-7:2006/Cor. 1, AAC
14496-3 Audio
DoC on ISO/IEC 14496-3:2005/DCOR. 6, AAC
ISO/IEC 14496-3:2005/Cor. 6, AAC
DoC on ISO/IEC 14496-3:2005/AMD 2:2006/DCOR 4, HE-AAC
V2 Profile and ALS
ISO/IEC 14496-3:2005/AMD 2:2006/Cor. 4, HE-AAC V2 Profile
and ALS
DoC on ISO/IEC 14496-3:2005/AMD 3:2006/ DCOR 2, SLS
ISO/IEC 14496-3:2005/AMD 3:2006/Cor. 2, SLS
DoC on ISO/IEC 14496-3:2005/AMD 9:2008/DCOR 1, AAC-ELD
ISO/IEC 14496-3:2005/AMD 9:2008/Cor. 1, AAC-ELD
ISO/IEC 14496-3:2009/FPDAM 1:200X HD-AAC Profile
Thoughts on MPEG Surround Signaling
Issues concerning frame lengths in the AAC family profiles
14496-4 Conformance testing
ISO/IEC 14496-4:2004/Cor. 6, AAC-LD
ISO/IEC 14496-4:2004/DCOR 7, Removal of Audio Conformance
DoC on ISO/IEC 14496-4:2004/AMD13:200x/DCOR 2, AAC-LD
bitstreams
ISO/IEC 14496-4:2004/AMD13:200x/Cor. 2, AAC-LD bitstreams
DoC on ISO/IEC 14496-4:2004/PDAM 36, AAC-ELD, OAFI and
additional AAC Conformance
14496-5 Reference Software
DoC on ISO/IEC 14496-5:2001/Amd.10:2007/COR 3, ALS and
SLS
ISO/IEC 14496-5:2001/Amd.10:2007/COR 3, ALS and SLS
Request for Subdivision of 14496, Audio Conformance
ISO/IEC 14496-26:2009, Audio Conformance
ISO/IEC 14496-26:2009/DCOR 1, ALS and SLS updates
ISO/IEC 14496-26:2009/FPDAM 1, AAC-ELD, OAFI additional
AAC and MPEG-1/2 on MPEG-4 Conformance
WD on additional BSAC conformance streams for T-DMB
15938-8 Extraction and Use of MPEG-7 Descriptions
DoC on ISO/IEC TR 15938-8:2002/PDAM 4, Extraction of audio
features from compressed formats
ISO/IEC TR 15938-8:2002/DAM 4, Extraction of audio features
from compressed formats
23003-1 MPEG Surround
ISO/IEC 23003-1:2007/DCOR 2, Misc. Corrections
ISO/IEC 23003-1:2007/AMD 2:2008/DCOR 1, Ref. Sw. Update
145
TBP Available
No
09/02/06
No
09/02/06
No
No
09/02/06
09/02/06
No
No
No
09/02/06
09/02/06
09/02/06
No
09/02/06
No
No
No
No
No
No
YES
09/02/06
09/02/06
09/02/06
09/02/06
09/02/06
09/02/06
09/02/06
No
No
No
09/02/06
09/02/06
09/02/06
No
No
09/02/06
09/02/06
No
09/02/06
No
No
No
No
No
09/03/06
09/02/06
09/03/06
09/03/06
09/02/20
No
09/02/06
No
09/02/06
No
09/02/06
No
No
09/02/06
09/02/06
23003-2 SAOC
10416 Study on ISO/IEC FCD 23003-2:200x, Spatial Audio Object
Coding
10417 Status and Workplan on SAOC Core Experiments
23003-3 Unified Speech and Audio Coding
10418 WD2 of USAC
10419 Workplan for USAC CEs
10420 MPEG Reference Encoder and the Audio CE Process
10421 Workplan on MPEG Reference Encoder
10422 Draft Revisions to MPEG Audio CE methodology
10423 Thoughts on Efficient Bitstream Syntax
Liaison Statements
10424 Response to DRM on MPEG-4 AAC Technology and Profiles
10425 Response to ETSI/EBU/CENELEC JTC on MPEG-4 AAC
Technology and Profiles
10426 Response to WorldDMB Forum on MPEG-4 AAC Technology and
Profiles
10427 Response to IEC TC100/TA4 on IEC CDV 61937-11 and 609583/Amd.1
Responses to National Bodies
10428 Response to AUNB Comments on USAC
10429 Response to AUNB Comments on MetaData
10430 Response to FR, FI and CN NB Comments on USAC
146
No
09/02/20
No
09/02/06
No
No
No
No
No
No
09/03/06
09/02/06
09/02/06
09/02/06
09/02/06
09/02/06
No
No
09/02/06
09/02/06
No
09/02/06
No
09/02/06
No
No
No
09/02/06
09/02/06
09/02/06
Annex E Agenda for the 88th MPEG Audio Meeting
Agenda Item
1. Opening of the meeting
2. Administrative matters
2.1. Communications from the Chair
2.2. Approval of agenda and allocation of contributions
2.3. Review of task groups and mandates
2.4. Approval of previous meeting report
2.5. Review of AhG reports
2.6. Joint meetings
2.7. Received national body comments and liaison matters
3. Plenary issues
4. Task group activities
4.1. MPEG-1, MPEG-2, MPEG-4, and MPEG-7
4.2. Spatial Audio Object Coding
4.3. Unified Speech and Audio Coding
4.4. Exploration: Meta-Data
5. Discussion of unallocated contributions
6. Meeting deliverables
6.1. Responses to Liaison and NB comments
6.2. Recommendations for final plenary
6.3. Establishment of new Ad-hoc groups
6.4. Approval of output documents
6.5. Press statement
7. Future activities
8. Agenda for next meeting
9. A.O.B
10. Closing of the meeting
147
Annex J – 3DG report
Source: Marius Preda, Chair
1
Opening of the meeting
1.1
Approval of the agenda
The agenda is approved.
1.2
Goals for the week
The goals of this week are:







Review SC-3DMC contributions and issue the associated CD and CE
Discuss the software status for SC-3DMC
Review the votes
Discuss FAMC, Scene Partitioning RefSoftware and Conformance
Status of software implementation in MP25 (especially the IC integration issues)
Compile and test reference software
Check the validity and re-generate when necessary conformance data for 3DGC
 Issue a new part of 14496 containing only 3DGC conformance
 Investigate future developments of MPEG 3D Graphics Compression
 Review Liaisons
1.3
Standards from 3DGC
4
4
2004 Amd.33
4
4
2004 Amd.34
4
4
2004 Amd.39
4
5
2001 Amd.22
4
5
2001 Amd.25
4
16
2006 Amd.4
4
16
200x 3rd Ed.
1.4
Multiresolution profile
conformance
3DGC Model
Conformance
Scene partitioning
conformance
3DG Compr. Model
RefSof
scene partitioning
RefSof
Scalable complexity
3D mesh coding
AFX
Room allocation
3DGC:
CM100
148
07/04
06/07
08/01
07/10
07/10
08/07
09/02
3
08/01
08/07
09/02
3
08/10
09/02
09/07
3
08/01
08/07
09/02
3
08/10
09/02
09/07
3
09/02
09/07
10/01
3
09/0
3
1.5
Allocation of contributions
N°
D1
m15945
m16151
m16187
Title
Schedule
D1
09:00~11:30
13:00~14:00
14:00~15:30
Monday
MPEG Plenary
Lunch Break
3DG Plenary
Roll call, Agenda, Goals, FAQ, etc.,
Report of AHG on 3DGC documents, experiments and software
maintenance
Results of voting
Liaison
Report on MXM latest developments
MXM API for 3D Graphics content creation
MXM use-case proposals for 3D services
Marius Preda
Patrick Gioia, Francisco Moran
Marius Preda
Francisco Moran
Marius Preda
Ivica Arsov,
Marius Preda
Francoise Prêteux
Patrick Gioia
15:30~16:00
16:00 – 18:00
Coffee Break
Scalable Complexity 3D Mesh Encoding (SC-3DMC)
m16196
Bitstream Syntax and Semantics for QBCR and SVA
m16149
Scalable Complexity Mesh Coding Benchmark
m16195
An Explanation of SVA and QBCR En-Decoding Algorithm
149
Seungwook Lee
Bonki Koo
Daiyong Kim
Kyoungsoo Son
Euee S. Jang
Benoit Le Bonhomme, Marius
Preda, Françoise Preteux
Kyoungsoo Son
Seungwook Lee
Bonki Koo
Daiyong Kim
Euee S. Jang
m16025
D2
Corrections to "WD3.0 of ISO/IEC 14496-16 AMD4, Scalable
Complexity 3D Mesh Coding"
Sergio Arnaldo
Francisco Morán Burgos
D2
09:00~12:00
Tuesday
Scalable Complexity 3D Mesh Encoding (SC-3DMC)
m16197
CE Report Version 3 on the SC3DMC
m16148
Attributes Encoding for TFAN
m16150
MMW.com API extension for 3D graphics attributes
Seungwook Lee
Bonki Koo
Daiyong Kim
Kyoungsoo Son
Euee S. Jang
Khaled Mamou
Titus Zaharia
Marius Preda
Françoise Preteux
Benoit le Bonhomme
Marius Preda
Francoise Prêteux
Marius Preda
SC-3DMC Editing Plan
Lunch Break
ok
Joint with System on MXM
Joint with System on MPEG-V
AFX Conformance and RefSoft
Filippo Chiariglione
Jean Gelissen
m16198
A Report on the Conformance Test of 3D Graphics Group
m16199
A Report on the Reference Software of SC3DMC
Daiyong Kim
Seungwook Lee
Kyoungsu Son
Preda Marius
A Report on the Reference
Software of SC3DMC
17:00~18:00
MP25
m16211
12:00~14:00
14:00 – 15:00
15:00 – 16:00
16:00~17:00
Source code for Interpolation Compression for MPEP-4 part 25
150
Sinwook Lee
Sowon Kim
Jeonghwan Ahn
Euee S. Jang
m16152
D3
Blagica Jovanova
Marius Preda
Françoise Preteux
Selecting elementary streams in MP25 RefSoft
Wednesday
MPEG Plenary
Lunch Break
Joint with Video on RVC
AFX: Mesh Grid
SC-3DMC
Editing of 14496-16 AMD 4 (check with Sergio if the updates are
considered)
AFX
Editing of AFX 3rd Edition
D4
Marco Mattavelli, Euee S. Jang
D3
09:00~11:00
13:00~14:00
14:00 – 15:00
15:00~16:00
16:00~17:00
all
17:00~18:00
Marius Preda
Thursday
Editing of 14496-27 (3DGC Conformance)
Avatar characteristics
Editing of 14496-16 AMD 4
Seungwook Lee
Jeong-Hwan Ahn
D4
9:00 – 12:00
Lunch Break
14:00 – 18:00
AFX Issues (Ref Soft)
3DGCM issues (IC in RefSoft)
Ref soft for SC3DMC
Liaison with X3D
3 DoC
D5
Friday
3DG output documents preparation
Liaison statements review
AhGs and resolutions
Lunch Break
MPEG Plenary
all
D5
09:00~12:00
12:00~14:00
14:00~
151
152
1.6
Attendance list
Name
Marius Preda
Francoise Preteux
Francisco Morán Burgos
Seung Wook Lee
Euee S. Jang
Byoungjun Kim
Mingxiao Chen
Jeong-Hwan Ahn
Country
France
France
Spain
Korea
Korea
Korea
Korea
Korea
2
General issues
2.1
General discussion
Company
Institut TELECOM
Institut TELECOM
UPM
ETRI
Hanyang Univ.
Hanyang Univ.
Hanyang Univ.
Samsung
2.1.1
Reference Software
It is recalled that the source code of both decoder AND encoder should be provided as part of the
Reference Software for all technologies to be adopted in MPEG standards. Moreover, not providing
the complete software for a published technology shall conduct to the removal of the corresponding
technical specification from the standard.
Currently almost all the AFX tools published in the second edition are supported by both encoder
and decoder implementation. Only exception is the MeshGrid tool; however commitment was
renewed by VUB.
2.1.2
Web site
OrangeLabs proposed a new version of the web site, now available at www.mpeg-3dgc.com. The
goal of the web site is to disseminate the group activities (documents, software and demonstration),
to maintain the FAQ and to be active in providing answers through the use of the Forum. 3DGC
contributors are kindly asked to check the web-site and provide comments.
3
Current Voting
Document title
ISO/IEC JTC 1/SC 29 N 9642 :ISO/IEC 144965:2001/FPDAM 22: Information technology -Coding of audio-visual objects -- Part 5: Reference
software AMENDMENT 22: Reference software for
3D Graphics Compression Model (3DGCM)
DoC
yes
Editor of DoC
Marius Preda
ISO/IEC JTC 1/SC 29 N 9640 :ISO/IEC 144964:2004/FPDAM 34: Information technology -Coding of audio-visual objects --Part 4:
yes
Marius Preda
153
Conformance testing AMENDMENT 34:
Conformance for 3D Graphics Compression Model
(3DGCM)
ISO/IEC JTC 1/SC 29 N 9638 :ISO/IEC 14496Yes
4:2004/FPDAM 33: Information technology -Coding of audio-visual objects -- Part 4:
Conformance testingAMENDMENT 33:
Multiresolution profile conformance
ISO/IEC JTC 1/SC 29 N 9817 :Combined PDAM
Non
Registration and PDAM Consideration Ballot
onISO/IEC 14496-4:2004/PDAM 39:Information
technology --Coding of audio-visual objects -- Part
4: Conformance testing,AMENDMENT 39:
Conformance testing for scene partitioning
ISO/IEC JTC 1/SC 29 N 9818 : Combined PDAM
Non
Registration and PDAM Consideration Ballot
onISO/IEC 14496-5:2001/PDAM 25:Information
technology --Coding of audio-visual objects -- Part
5: Reference software,AMENDMENT 25: Reference
software for scene partitioning
4
AFX (14496-16) related activities
4.1
AhG on AFX activities
Patrick Gioia
Report of AHG on 3DGC documents, experiments and software maintenance
Title
Authors Patrick Gioia, Francisco Moran
Summary See m15945
- use the reflector for exchanges on technology development
- Ivica Arsov is responsible for maintaining the 3DG reference software
Resolution
- Seungwook Lee is responsible for regenerating conformance and maintaining
it.
4.2
Scalable Complexity 3D Mesh Compression (14496-16 Amd.4)
Title
Authors
Bitstream Syntax and Semantics for QBCR and SVA
Seungwook Lee, Bonki Koo, Daiyong Kim, Kyoungsoo Son, Euee S. Jang
- Common header for the SC3DMC + specific header for each of the bitstreams
- Open issue: why is OptimizedforParallelDecoding should be signalized in the
bitstream
Summary - FDmode for SVA should go in the payload
- proposes the two manners of encoding the range: case 1 common range for X,
Y, Z; case 2 different ranges. Recommendation: use only case 1
Resolution Build a common header for SC3DMC.
Title
Scalable Complexity Mesh Coding Benchmark
154
Benoit le Bonhomme, Marius Preda, Francoise Prêteux
Results for all the SC3DMC tools (TFAN, QBCR and SVA) as well as 3DMC
Summary
and TG are presented.
Resolution Accepted.
Authors
Title
Authors
MMW.com API extension for 3D graphics attributes
Benoit le Bonhomme, Marius Preda, Francoise Prêteux
A new version of the API is available allowing to communicate the attributes
Summary
between encoding libraries and MMW.com
Resolution Accepted. This version should be used in the future experiments
Title
Authors
Summary
Resolution
Attributes Encoding for TFAN
Khaled Mamou, Titus Zaharia, Marius Preda, Françoise Prêteux
A syntax is proposed for encoding attributes in TFAN.
Accepted and updated as specified in the PDAM (output document)
An Explanation of SVA and QBCR En-Decoding Algorithm, CE Report
Version 3 on the SC3DMC
Authors Kyoungsoo Son, Seungwook Lee, Bonki Koo, Daiyong Kim, Euee S. Jang
The current benchmark results for QBCR do not emphasize the fact that QBCR
is a low complexity algorithm (the execution time for encoding and decoding
Summary
are similar to the one of other methods). The contribution presents an analysis
of the number of operations for QBCR and SVA.
Accepted. However the current implementation of the QBCR decoder should be
Resolution
revisited.
Title
Corrections to "WD3.0 of ISO/IEC 14496-16 AMD4, Scalable Complexity
3D Mesh Coding"
Authors Sergio Arnaldo, Francisco Morán Burgos
This contributions reports on several editorial and technical problems in the
Summary
current WD.
Resolution All the comments were addressed and solved.
Title
4.2.1
Scene partitioning (14496-11 Amd.6)
SP is followed as a joint activity between Systems and 3DGC. The technology is integrated in Part
11. There was no joint meeting with Systems on this topic during this meeting.
SP activity on conformance and reference software continued.
4.3
Maintenance
4.3.1
FAMC Conformance and Reference Software
FNB reports on a problem related to FAMC reference software, namely the usage of little endian
convention when writing the bitstream. This conducts to errors in parsing the FAMC bitstream
when encapsulated in MP4. Resolution: issue a corrigendum on FAMC ref soft and conformance
and ask the contributors to update the software and regenerate the bitstreams.
155
AFX 3rd Edition
4.3.2
The document was updated during the week. The final publication is delayed for April 2009 in
order to include current corrigendums.
4.4
Dataset and benchmarking
For Scalable Complexity 3D Mesh Coding, the www.MyMultimediaWorld.com will be used for
benchmarking.
4.5
Software
Title
Authors
Current status of MeshGrid compression software
A presentation of the current implementation (including a GUI) of MeshGrid
was demonstrated by VUB representatives. Some bugs still occur.
Submit the current version (command line) on the SVN and fix the bugs before
Resolution
the next meeting.
Summary
Title
Authors
A Report on the Conformance Test of 3D Graphics Group
Daiyong Kim, Seungwook Lee, Kyoungsu Son, Preda Marius
All the documents describing the conformance are identified as well as the
Summary
associated bitstreams
Accepted. Collect all the conformance documents within a new part of MPEG-4
Resolution (MPEG-4 Part 27), edit the resuest for subdivision and corrigendum for
removing the 3DG conformance from MPEG-4 Part 5.
Title
Authors
A Report on the Reference Software of SC3DMC
Seungwook Lee, Bonki Koo, Daiyong Kim, Kyoungsoo Son, Euee S. Jang
Current version of the IM1 available on the SVN is not complete (some projects
Summary
such as AACDecode) are missing.
A new version of the software was build during the week. This version should
Resolution
be commited to SVN and used for checking the conformance bitstreams.
4.6
Promotions
4.6.1
Title
Authors
Web Site
Status of www.mpeg-3dgc.com
Patrick Gioia
The web site is in beta version but no improvement was done since the last
Summary
meeting
Action Point:
Resolution Patrick Gioia will ask more actively contributions for demos from individual
parties.
156
4.7
Future
4.7.1
Metaverse)
MPEG-V - Information Exchange with Virtual Worlds (formally
Title
Joint meeting with Systems on MPEG-V
Authors Jean Gelissen
Summary The MPEG-V WD was reviewed.
Action Point:
Resolution The avatar information part should only integrate metadata and do links to the
media layer already specified by MPEG-4 tools (mesh, animation, texture).
Title
Authors
Avatar Characteristics
Jeong-Hwan Ahn
This contribution presents a model for representing avatar characteristics of
Summary various nature: skeleton configuration, animation types, mental state. The
definition is not complete but only at a concept level.
Action Point:
Resolution Review the existent literature and available systems (VHML, SL, IMVU, …)
for avatar metadata. Continue the discussions on the reflector.
4.7.2
MXM
Title
Authors
Report on MXM latest developments
Marius Preda
Informal discussion on possible impact of MXM activities on technologies
Summary
developed by 3DG group
Action Point:
Resolution Actively participate in proposing a complete API for accessing 3D graphics
tools.
Title
Authors
MXM use-case proposals for 3D services
Patrick Gioia
Three applications (Virtual Worlds, 3D GPS and 3D Yellow pages) were
Summary presented as well as their requirements with respect to the communication
protocol between client and server.
It was identified that MXM already proposes some communication protocols.
Resolution This should be extended to support communication needed for the above
mentioned applications.
Title
Authors
MXM API for 3D Graphics content creation
Ivica Arsov, Marius Preda, Francoise Prêteux
A complete implementation of the MXM 3D graphics engine was presented. It
Summary covers the encoding and decoding API. The documentation was done with
Doxygene.
Resolution Accepted.
157
4.7.3
Future directions of 3D Graphics Compression
Title
Authors
Joint meeting with video on RVC
Marco Mattavelli
This is a first presentation of the RVC framework in the 3DG group. The
architecture of the framework is generic (being applicable to all kind of media
Summary compression), the functional units implemented currently refer for video coding
(in a Video Tool Library).
Start investigation on a Video Graphics Library.
Resolution Make a collection of documentation related to RVC (tutorials, papers, software)
Contact person is (Shin Hwaseon, Ketu) L544@keti.re.kr
5
3D Graphics Compression Model (14496-25) activities
5.1
Software and conformance
Title
Authors
Selecting elementary streams in MP25 RefSoft
Blagica Jovanova, Marius Preda; Françoise Preteux
The selection of the XML elements in COLLADA for transmission to the
Summary corresponding encoders is explained. A new GUI is proposed as a wrapper for
MP25 encoder and decoder software.
Resolution Accepted, the GUI is considered as an utility part of the RefSoft. Commit the
new GUI on the SVN
Title
Authors
Source code for Interpolation Compression for MPEP-4 part 25
Sinwook Lee, Sowon Kim, Jeonghwan Ahn, Euee S. Jang
Initial implementation for the IC encoder and decoder is available as part of
Summary
MPEG-4 Part 25 reference software. Some bugs are identified.
Resolution Accept the reference software with the condition to fix the bugs.
6
Liaison
Title
Authors
Answer to liaison statement of SC24
All
SC24 informs SC29 on the FCD of the Second Edition of ISO/IEC 19775-2,
Summary
i.e., Part 2 (Scene Access Interface – SAI) of your Extensible 3D (X3D)
Include in the answer the fact that ISO/IEC 14496-11:2005, i.e., Part 11 (Scene
Resolution description and application engine) includes some components addressed by
the new SC24 standard.
158
7
Output documents and Resolutions of 3DGC
7.1
Part 4
7.1.1
Conformance testing
The 3DG subgroup recommends approval of the following documents
No.
Title
14496-4 Conformance testing
10320 DOCR on ISO/IEC 14496-4:2004/FPDAM 33 (Multi Resolution
Profile Conformance)
10321 DOCR on ISO/IEC 14496-4:2004/FPDAM 34 (3D Graphics Model
Conformance)
10322 Text of ISO/IEC 14496-4:200x/DCOR 7 (Removal of 3DG
Conformance)
7.2
Part 5
7.2.1
No.
10323
10324
10325
10326
10327
7.3
09/02/06
No
09/02/06
No
09/02/06
The 3DG subgroup recommends approval of the following documents
TBP Available
No
09/02/06
Yes
09/02/20
No
09/02/20
No
No
09/02/06
09/02/20
The 3DG subgroup recommends nominating Seung Wook Lee (ETRI) and
Khaled Mammou (Institut TELECOM) as editors of 14496-5:2001 AMD
27.
Management/Liaison
7.3.1
The 3DG subgroup recommends approval of the following documents
No.
Title
14496-16 Animation Framework eXtension (AFX)
10328 Answer to liaison from W3D
7.4
No
Reference Software
Title
14496-4 Reference Software
DOCR on ISO/IEC 14496-5:2001/FPDAM 22 (3DGCM Reference
Software)
Text of ISO/IEC 14496-5:2001/FDAM 22 (3DGCM Reference
Software)
Text ISO/IEC 14496-5:2001/FPDAM 25 (Scene Partitioning
Reference Software)
Request for Amendment: 14496-5:2001/PDAM27
Text of ISO/IEC 14496-5:2001/PDAM27 (SC3DMC RefSoft)
7.2.2
TBP Available
TBP Available
No
09/02/06
Part 16 Animation Framework eXtension (AFX)
7.4.1
The 3DG subgroup recommends approval of the following documents
No.
Title
14496-16 Animation Framework eXtension (AFX)
10329 Text of ISO/IEC 14496-16:2006/PDAM4 (Scalable Complexity 3D
Mesh Compression)
10330 CE on Scalable Complexity 3D Mesh Coding
159
TBP Available
No
09/02/20
No
09/02/06
10331 WD of ISO/IEC 14496-16 3rd Edition
7.4.2
7.5
Part 27
7.5.1
No.
10332
10333
10334
10433
10335
7.6
No
09/02/06
The 3DG subgroup recommends to add Khaled Mammou (Institut
TELECOM) to the editor list of 14496-16:2006 AMD 4.
3D Graphics Conformance
The 3DG subgroup recommends approval of the following documents
Title
14496-27 Conformance testing
Request for subdivision of ISO/IEC 14496-27
Text of ISO/IEC 14496-27:2009/FDIS (3DG Conformance)
Text of ISO/IEC 14496-27:2009/FPDAM1 (Scene partitioning
conformance)
Request for Amendment: 14496-27:2009/PDAM2 (SC3DMC
Conformance)
Text of ISO/IEC 14496-27:2009/PDAM2 (SC3DMC Conformance)
TBP Available
No
No
No
09/02/06
09/02/13
09/02/06
No
09/02/06
No
09/02/20
7.5.2
The 3DG subgroup recommends nominating Daiyong Kim (HYU) and
Francisco Morán (UPM) as editors of 14496-27:2009.
7.5.3
The 3DG subgroup recommends nominating Seung Wook Lee (ETRI) and
Khaled Mammou (Institut TELECOM) as editors of 14496-27:2009 AMD
2.
Establishment of 3DGC Ad-Hoc Groups
10336
Mandate:
AHG on 3DGC documents, software maintenance and core experiments
1. Conduct the experiments in Scalable Complexity Mesh Compression
2. Coordinate 3DGC related conformance and reference software
3. Maintain and edit 3DGC documents
4. Coordinate editing of the www.mpeg-3dgc.com web site
Chairmen: Francisco Morán Burgos
Patrick Gioia
Duration: Until 88th Meeting
Sunday before 88th meeting
Meetings
Reflector: mpeg-3dgc AT gti. ssr. upm. es
Subscribe: https://mx.gti.ssr.upm.es/mailman/listinfo/mpeg-3dgc
8
Closing of the Meeting
See you in Maui.
160
Download