MPEG for the past, present and future of television Leonardo Chiariglione Keynote Speech at ATSC Annual Meeting Washington, DC – 2012/05/08 MPEG for the past of television System Transport Stream DSM-CC Timed metadata Video MPEG-2 High Profile Audio MPEG-1 Audio Layer II MPEG-2 AAC MPEG for the past, present and future of TV 2012/05/02 2 An assessment MPEG has provided the means for the television business on air, cable and satellite to migrate from the analogue to the digital age It has been a collective efforts Representatives from the entire spectrum of the broadcasting world provided their requirements Packages of patent licences were created (outside of MPEG) Today there is virtually no broadcasting system that is not based, at least partially, on MPEG standards MPEG for the past, present and future of TV 2012/05/08 3 MPEG for the present of television Systems Carriage of MPEG-4 on MPEG-2 TS Dynamic Adaptive Streaming over HTTP (DASH) Video MPEG-4 Advanced Video Coding (AVC) 3D extensions Audio Various forms of Advanced Audio Coding (AAC) Universal Speech and Audio (USAC) MPEG for the past, present and future of TV 2012/05/08 4 An assessment The deployment of 2nd generation digital broadcasting systems continues successfully Use of the AVC on MPEG-2 TS allows packing twice as many programs as MPEG-2 Video in an analogue channel With IPTV the notion of broadcasting is expanding MPEG is helping the broadcasting industry To enrich the broadcasted user experience Through standard technologies that serve multiple fields MPEG for the past, present and future of TV 2012/05/08 5 MPEG for the future of television Well, that’s what I am supposed to talk about… MPEG for the past, present and future of TV 2012/05/08 6 Broadcasting is a great business to be in… Broadcasting is a proven business with a constant revenue flow Many technologies appear that Can help new competing businesses but Can also be used to extend the broadcasting business MPEG is a neutral body devoted to the development of standard technologies for user communities to exploit MPEG for the past, present and future of TV 2012/05/08 7 MPEG-H: a video standard for future broadcasting ISO/IEC 23008 High Efficiency Coding and Media Delivery in Heterogeneous Environments A suite of standards for media coding and delivery Currently scheduled to appear in 3 parts MPEG Media Transport High Efficiency Video Coding 3D Audio Timeline Part CD DIS FDIS MMT 12/07 13/01 13/07 HEVC 12/02 12/07 13/01 3D Audio 14/04 14/10 15/01 MPEG for the past, present and future of TV 2012/05/08 8 Service composition in MMT MMT Package composition information friendly with html5 <subtitle> Bird is flying… UD contents 3D Earth Bird is flying… legacy broadcasting MPEG for the past, present and future of TV future broadcasting 2012/05/02 9 Smart delivery in Clouds video file Web Clouds pull pull MMT Encapsulator - Service Component List Component location Composition relationship QoE requirement How to deliver How to consume MMT Frame - Configuration - Adaptive delivery - Sync, Delay, Loss control 2012/05/08 10 Mash-up service Audio Video1 Video2 widget Web page MMT assets Service 1 (MMT Package1) MPEG for the past, present and future of TV Service 2 (MMT Package2) 2012/05/08 11 Second screen management MMT Package composition information <device 1> … <device2> … Video content on 2nd device Additional Information with Live Content (HTML) MPEG for the past, present and future of TV 2012/05/08 12 MPEG-H part 2 High Efficiency Video Coding Primary goal: ≥ 50% better compression than AVC For large displays (e.g. 4kx2k) For mobile (e.g. WVGA) Developed jointly with ITU-T by Joint Collaborative Team (JCT) on Video Coding Technically in the same stream of past MPEG Video Coding standards (AVC and prior standards) Currently at Committee Draft level MPEG for the past, present and future of TV 2012/05/08 13 HEVC vs AVC Subjective Performance Measurements Preliminary subjective tests carried out for HEVC Committee Draft Reasonably small and non-overlapping confidence intervals Average rate savings at same (interpolated) MOS points HEVC with 67% lower rate in class B (Full HD) HEVC with 49% lower rate in class C (WVGA) HEVC with 58% lower rate overall MPEG for the past, present and future of TV 2012/05/08 14 A performance sample MPEG for the past, present and future of TV 2012/05/08 15 For those technology aware… Block-based/Variable block sizes Block motion compensation Fractional-pel motion vectors Spatial intra prediction Spatial transform of residual difference Integer-based transform designs Arithmetic or VLC-based entropy coding In-loop filtering to form final decoded picture MPEG for the past, present and future of TV 2012/05/08 16 Where we are (May 2012) Activity: level of 1,000 documents per meeting crossed Very diverse participation from industry/academia Significant technical advance over prior standards Challenge: computational/implementation complexity Deliverables: Spec Reference software Conformance Currently only one profile MPEG for the past, present and future of TV 2012/05/08 17 HEVC Scalability Extensions/1 Serve the needs of heterogeneous environment of future digital TV distribution Multiple devices: SHDTVs, set-top boxes, tablets, smart phones, PCs etc. Wide range of processing powers, display sizes and power consumption needs Multiple networks: Cable, Satellite, Terrestrial, CDN, Cable Modem, xDSL, WiFi, 4G Wide range of channel bandwidths and QoS Multiple protocols: MPEG-2 TS, DASH, HLS, RTP, UDP Multiple services: Broadcasting, On demand, Streaming, Over The Top MPEG for the past, present and future of TV 2012/05/08 18 HEVC Scalability Extensions/2 Options for backward compatible migration paths From 720p60 to 1080p60 From1080p60 to 4kx2kx60 From AVC to HEVC Time Line May 2012 Preliminary Call for Proposals (public) July 2012 Final Call for Proposals (public) October 2012 Evaluation of proposals & start of collaborative design phase Development of the time line associated with standardisation phases (WD, CD/PDAM, FDIS/FDAM, DIS/AMD) MPEG for the past, present and future of TV 2012/05/08 19 Service Transitions 1950s: Color TV Analog, backward compatible 1990s: Digital TV New infrastructure required Transitions from SD to HD 2010s: 3D Mixed results from services introduction is Not a single format across all services MPEG for the past, present and future of TV 2012/05/08 20 Current Status of 3D Video Increasing investments in 3D production and services Increasingly capable 3D displays in the market (many competing and emerging technologies) Market is primarily stereo (mix of different formats being deployed right now) Focus of current MPEG work: development of a new 3D format and associated compression techniques that could facilitate generation of multiview output MPEG for the past, present and future of TV 2012/05/08 21 Existing 3DV Coding Formats Left Right Pack pixels from left and right views into a single frame (loss of resolution) Leverage existing infrastructure and equipment, only require additional signaling to de-interleave frame Embraced by broadcasters for first phase of 3D broadcast services Full-resolution coding of stereo and MV video as extension of AVC High coding efficiency achieved via inter-view prediction techniques 2D compatibility supported Adopted as format for 3D Blu-ray Disc Being considered for second phase of broadcast standards Left View Right View Effective prediction between views for high compression performance MPEG for the past, present and future of TV 2012/05/08 22 Targets of Future 3DV Format Stereoscopic displays • Variable stereo baseline • Adjust depth perception Left Right Limited Camera Inputs Data Format Data Format Constrained Rate (based on distribution) Auto-stereoscopic N-view displays MPEG for the past, present and future of TV • Wide viewing angle • Large number of output views 2012/05/08 23 3DV Reference Framework Limited # Video Inputs (e.g., 2 or 3 views) Depth Estimation Video/Depth Encoder Decoder and View Synthesis Larger # Output Views 1010001010001 + Binary Representation & Reconstruction Process MPEG for the past, present and future of TV 2012/05/08 24 Overview of Call For Proposals Test Material & Conditions 8 test sequences (1920x1080 & 1024x768) 4 target bit rates per sequence 2-view and 3-view test scenarios AVC and HEVC compatible test categories 23 responses (12 AVC + 11 HEVC) Evaluation in Nov 2011 Objective quality (PSNR of synthesized views) Subjective assessment (stereo and auto-stereo) MPEG for the past, present and future of TV 2012/05/08 25 3DV Testing Scenarios 2-view Configuration Left Depth Estimation 3D Video Codec Multiview Rendering virtual view Output to Stereo Display Right 3-view Configuration Left Output to Auto-Stereo N-view Display 3D Video Codec Center Right Depth Estimation MPEG for the past, present and future of TV Multiview Rendering virtual views 2012/05/08 26 Tool Categories Texture coding Independent of depth E.g., inter-view prediction of color view, inter-view prediction of motion parameters and residual data Using depth data E.g., view synthesis prediction, motion prediction Depth coding Independent of texture E.g., depth modeling modes, weighted prediction, reduced resolution coding Using texture data E.g., motion parameter inheritance, intra prediction MPEG for the past, present and future of TV 2012/05/08 27 3DV Current Status Extensive subjective evaluation of 20+ proposals to the 3DV CfP Considered both AVC and HEVC compatible Many new coding tools proposed for MV texture and depth coding Substantial rate savings compared to capabilities of existing standards AVC and HEVC extensions standardisation under wayd Add support for depth New AVC-based coding tools MV extensions of HEVC Hybrid architectures MPEG for the past, present and future of TV 2012/05/08 28 Hybrid Architectures From a pure compression efficiency point of view, it is always best to use the most advanced codec However, when introducing new services, providers must also consider capabilities of existing receivers and transition plan Use Case: Many terrestrial broadcast systems based on MPEG-2 US cable systems based on mix of MPEG-2 and AVC May not be easy to simply switch codecs in near term MPEG for the past, present and future of TV 2012/05/08 29 Multiview Extensions of HEVC HEVC Stereo/MV extensions being considered Extensions expected to be completed approximately 12 months after the base specification, i.e., Jan 2014 Reportedly simple extensions of HEVC can achieve 3040% bit rate reduction compared to HEVC simulcast Further gains expected if block-level changes to codec are considered MPEG for the past, present and future of TV 2012/05/08 30 MPEG-H part 3: 3D Audio Well, there is no Systems and Video without Audio… MPEG for the past, present and future of TV 2012/05/08 31 3D Audio requirements/1 High quality: quality of decoded sound perceptually transparent Localization and Envelopment: Accurate sound localization and very high sense of sound envelopment within a targeted listening area (“sweet spot”) Flexible Loudspeaker Placement: the transmitted audio program to be mapped to a rendering setup in which loudspeakers are in alternate locations Interactivity: Interactive modification of the sound scene rendered from the coded representation MPEG for the past, present and future of TV 2012/05/08 32 3D Audio requirements/2 Rendering on setups with fewer loudspeakers (including headphones): ability to derive a signal from the transmitted program material for reproduction with fewer loudspeakers Audio/visual alignment & consistency: ability to adapt the rendered acoustic scene to be consistent with the visual Efficiency for decoding on different setups: ability to be rendered on all reproduction loudspeaker setups or headphone MPEG for the past, present and future of TV 2012/05/08 33 3D Audio requirements/3 Appropriate computational complexity and system latency for the target application (Broadcasting, Spatial two-(n-)way communication / telepresence Transcoding for low bandwidth devices: A lower bandwidth signal can be extracted from the original program material Backward compatibility: (e.g. to 5.1 channel programs and decoders or transmission of HRTF encoded signals) MPEG for the past, present and future of TV 2012/05/08 34 Envisioned Architecture 22.2 Loudspeakers Home Theatre Spatialized on Headphones 22.2 Channel Audio Program Audio Objects 3D Audio Encoder 500 kb/s bitstream 3D Audio Encoder In-Frame Speakers Tablet TV Spatialized on Headphones Audio Scene Transcoder Inputs Audio Channels Channel – “normal” content Audio Objects • 64 kb/s bitstream – “helicopter overflight” 500 kb/s primary channel 64 kb/s cellular channel Smart Phone TV Spatialized on Headphones • Outputs – – Loudspeakers Headphones • – Spatialized sound In-frame loudspeakers • “Sound Bar” Audio Scene Ambisonics MPEG for the past, present and future of TV 2012/05/08 35 Audio, part of an Audio-visual Experience What is new in video? Larger, high-resolution displays Greater sense of envelopment Possibly closer viewing distance More efficient compression Wireless transmission to portable devices Immersive and enveloping audio experience MPEG for the past, present and future of TV 2012/05/08 36 Use Case 1 Home Theatre Many loudspeakers 10.1 22.2 “3D” Experience Height loudspeakers for greater envelopment Sense that audio objects are near or distant When near, they are consistent with video image MPEG for the past, present and future of TV 2012/05/08 37 22.2 multichannel sound system (NHK) TpFC TpFL TpFR Top layer 9 channels TpC TpSiL TpSiR TpBC TpBR TpBL FLc FRc FC FR FL Middle layer 10 channels TV Screen SiL SiR BtFC BtFL BtFR LFE1 LFE2 BR BL BC MPEG for the past, present and future of TV Bottom layer 3 channels LFE 2 channels 2012/05/08 38 Home Theater – Issues Maintain compatibility with legacy systems: 5.1, stereo Will content providers really step up to 22.2? Will consumers adopt 22.2?: 2 front high How to compress 22.2 for transmission? MPEG AAC 64 kb/s/channel: -> 1.5 mb/s How to render for legacy setups? Relatively slow-growth market MPEG for the past, present and future of TV 2012/05/08 39 Use Case 2 Personal TV Small but high-resolution display “super-tablet” Local wireless communications To cable or fiber home hub Possible audio presentation Headphones Loudspeakers around perimeter of display Issue: To what extent can the user get an enveloping experience from only “front” loudspeakers? MPEG for the past, present and future of TV 2012/05/08 40 Use Case 3 Mobile TV Hand-held display: Smartphone Headphone listening: Stereo, perhaps with binauralisation Fastest growing market: Quick rollout and adoption of standard What is new? USAC for stereo; MPEG Surround for binauralisation What is the “Wow” factor? To spur adoption of new technology MPEG for the past, present and future of TV 2012/05/08 41 User Experience Spatialized on headphones Home Theatre Tablet TV MPEG for the past, present and future of TV 2012/05/08 42 Flexible Rendering Content providers create 22.2 program Consumed on stereo, 5.1, 10.1, 22.2 layouts Consumed on “wrong” layouts Mis-positioned surrounds Missing surrounds Non-standard layouts 2 front high, 5.1 mid MPEG for the past, present and future of TV 2012/05/08 43 3D Audio Test Platform NHK “Loudspeaker Array Frame” (LAF) Suggestions for Alternative platform for assessing “Home Theatre” use case Model for assessing “Personal TV” use case MPEG for the past, present and future of TV 2012/05/02 44 Summary Provide compelling immersive audio experience for audio/visual programs Create content once, present on many different loudspeaker layouts or on headphones Provide high compression and high-quality audio presentation MPEG for the past, present and future of TV 2012/05/08 45 MPEG-V: Humans have more than two senses MPEG for the past, present and future of TV 2012/05/08 46 Augmented Reality Application Format (ARAF) Remote Real World Environment Remote Sensors & Actuators Local Real World Environment Local Sensors & Actuators MARBle User ARAF file or stream Media Servers MPEG for the past, present and future of TV Service Servers 2012/05/08 47 Augmentation Region in ARAF AR service provider A User A Broadcaster User B Augmentation Region MPEG for the past, present and future of TV AR service provider B 2012/05/02 48 The past, the present and the future, again 25 years ago MPEG selected the digital media way bringing innovation and interoperability …and thrived This year the broadcasting industry has started a move in future broadcast services with interoperability at its core …and it is bound to thrive MPEG looks forward to continuing a cooperation with the broadcasting industry providing the necessary standard technologies MPEG for the past, present and future of TV 2012/05/08 49 http://mpeg.chiariglione.org/ MPEG for the past, present and future of TV 2012/05/08 50