The MPEG-7 Standard - A Brief Tutorial - Ali Tabatabai Sony US Research Laboratories February 27, 2001 Outline Objectives of the MPEG-7 Standard Main Elements of MPEG-7 Scope of MPEG-7 MPEG-7 Application Areas MPEG 7’s relation with other standards 2 Why do we need MPEG-7 ? Support for Advanced Query • Fast & Accurate Access Need • Personalized Content Production and Consumption • Content Management + •Visual •Audio •Sketch • Automation 3 MPEG: A Brief History (1) MPEG: Moving Picture Experts Group Established in 1988 ISO / IEC/JTC1/SC29/WG11 A Working Group of ISO/IEC in charge of the Development of Standards for Coded Representation of Digital Audio and Video 4 MPEG: A Brief History (2) MPEG-1: Interactive CD and MP3 11 / 1992 MPEG-2: DTV, STB, DVD 11 / 1994 MPEG-4: Web and Mobility ver1: 09 /1998 ver2: 11 /1999 MPEG-7: ??? MPEG-21: Multimedia Framework 08 / 2001 11 / 2001 5 MPEG-7: What Is It ? THE MPEG 7 STANDARD IS NOT a COMPRESSION Standard similar to MPEG-1/2/4 or their Extension IS NOT a STANDARD for FEATURE EXTRACTION/MATCHING Content Description of Various Audio Visual Information Types of Audio Visual Information • Audio, Speech • Moving video, still pictures, graphics • Information on how objects are combined in scenes 6 MPEG-7: Application Areas Storage and retrieval of audiovisual databases (image, film, radio archives) Broadcast media selection (radio, TV programs) Surveillance (traffic control, surface transportation, production chains) E-commerce and Tele-shopping (searching for clothes / patterns) Remote sensing (cartography, ecology, natural resources management) Entertainment (searching for a game, for a karaoke) Cultural services (museums, art galleries) Journalism (searching for events, persons) Personalized news service on Internet (push media filtering) Intelligent multimedia presentations Educational applications Bio-medical applications MPEG-7 Description Scope for AV Content Description Granularity Low-level High-level Form Access Classification Link Context 8 MPEG-7: Main Elements Descriptors (D) syntax and semantics of each feature representation Description Schemes (DS) structure and semantics of the relationships between components Description Definition Language (DDL) creation of new DS’s modification/extension of existing DS’s 9 MPEG-7: Major Functionalities Systems (ISO / IEC 15938 - 1) Description Definition Language (ISO / IEC 15938 - 2) Visual (ISO / IEC 15938 - 3) Audio (ISO / IEC 15938 - 4) Multimedia Description Schemes (ISO / IEC 15938 - 5) Reference Software (ISO / IEC 15938 - 6) 10 MPEG-7: Main Elements (2) DDL Instantiation DS1 DS2 <Object> <Label/> <Definition/> . . </Object> DS2 Systems DS3 D1 D2 D3 0001100 11 MPEG-7: Systems It defines tools to: provide for efficient storage and transport synchronize between content and description manage and protect intellectual property 12 MPEG-7: DDL and its Components Description Definition Language: Creation of the Ds and DS’s: XML Schema & MPEG-7 Extensions Instantiation of XML XML Schema: Data types Simple and Complex types Elements, attributes Inheritance, Abstract types MPEG-7 extensions: Array and Matrix data type 13 MPEG-7: Audio Sound Effects Music Instrument Timbre Spoken Content Melody Contour 14 MPEG-7: Visual (1) Color quantization, dominant, scalable, color-structure, layout, GoF/GoP Texture Shape region-based, contour-based, 3D Motion camera motion, motion trajectory, parametric motion, motion activity 15 MPEG-7: Visual (2) Localization spatio temporal Others face recognition 16 MPEG-7: Basic Visual Structures Grid Layout 2D-3D Multiple View Time Series Spatial 2D Coordinates Temporal Interpolation 17 Low level Audio Visual descriptors Video segments Still regions • • • • • • • • Color Camera motion Motion activity Mosaic Moving regions Color Shape Position Texture Audio segments • • • • Color Motion trajectory Parametric motion Spatio-temporal shape • Spoken content • Spectral characterization • Music: timbre, melody 18 MPEG-7: MMDS Basic Elements Root, Top-level elements, Packages Time, Duration, Medialocators Language Annotation,Person, Place Basic elements Datatype & structures Schema tools Link & media localization Basic DSs 19 MPEG 7: Content Management and Description Format, Coding, Instances, Identification, Transcoding Hint, etc. (Several instances) Title, Creator, Creation location & date, Purpose, Classification, Genre, Review, Parental guidance, etc. (Author generated) Rights holder, Access rights, Usage Record, Financial aspects, etc. (Evolution) Creation & production Media Content Usage Content management Content description Structural aspects Conceptual aspects Viewpoint of the structure: Segments Basic elements • Spatial / temporal structure • Audio,&video low-level Ds Schema Datatype • Elementary tools structures semantic information. Viewpoint of conceptual notions • Events, objects, abstract concepts, and Link & media their relation Basic DSs localization 20 Example of Segment trees SR1: Creation, Usage meta information Media description Textual annotation Color histogram, Texture SR6: Color Histogram Textual annotation Background SR3: Shape Color Histogram Textual annotation Foreground SR2: Shape Color Histogram Textual annotation SR5: Shape Textual annotation SR4: Shape Color Histogram Textual annotation 21 Time Axis Segment Tree Shot1 Segment 1 Sub-segment 1 Shot2 Semantic DS (Events) Shot3 • Introduction • Summary Sub-segment 2 Sub-segment 3 • Program logo • Studio • Overview Sub-segment 4 • News Presenter segment 2 • News Items Segment 3 • International • Clinton Case • Pope in Cuba Segment 4 • National Segment 5 Segment 6 • Twins • Sports • Closing Segment 7 22 MPEG 7: Navigation and Access Creation & Efficient support of: discovery, production browsing, navigation, visualization Media Content Usage Content management Navigation & Access Summary Content description Structural aspects Basic elements Datatype & structures Conceptual aspects Substitution of the original content Adaptation to terminal, network, or Schema user preferences Link & media tools localization Variation Basic DSs 23 MPEG 7: Hierarchical summary Hierarchical Summary HighlightLevel Highlight Segment Highlight Segment Highlight Segment HighlightLevel Highlight Segment Highlight Segment Highlight Segment Highlight Segment A-V Data 24 MPEG 7: Sequential summary Frame Property Sequential Summary Text Property Sound Property Frame Property Frame Property Text Property Sound Property Sound Property A-V Data 25 MPEG 7: Variation Universal Multimedia Access Adapt delivery to network and terminal characteristics (QoS) 26 MPEG-7: Content Organization Collection & Classification Content organization Description and organization of Creation & collection of documents Analytic Model Navigation & Access production Media Content Usage Content management Summary Content description Structural aspects Conceptual aspects Variation Basic elements Datatype & structures Schema tools Link & media localization Basic DSs 27 MPEG-7: Collection 28 MPEG 7: User Interaction Collection & Classification Content organization Analytic Model Navigation & Access Creation & production Media Content Usage Content management Summary User preferences Content description Structural aspects Conceptual aspects Variation User identification and preferences: Filtering, search and browsing Basic elements Datatype & structures User Interaction Schema tools Link & media localization Basic DSs 29 MPEG-7 Its Relation with other standards AHG on “Metadata harmonization”: SMPTE: Metadata dictionary, KLV encoding Dublin Core Metadata Initiative European Broadcast Union AHG on TV AnyTime Application Large number of Liaisons: SMPTE Dublin Core W3C (XML Schema) etc. 30 MPEG-7: TimeLine - The Work Plan 1996 Competition: Divergence • Individual work • Definition scope and r 1998 Call for proposals Convergence 1999 Working draft 2000 Committee draft Final committee draft 2001 International standard Draft international standard Conclusions on AV Content Description and MPEG-7 MPEG-7: AV content description for interoperable application Description Definition Language: XML Schema (flexibility) + Binary version (efficiency) Description Schemes: Library of description tools Covers a wide range of generic needs 32