Video Fundamentals September 9, 1999 Lawrence A. Rowe University of California, Berkeley URL: http://www.BMRC.Berkeley.EDU/~larry Copyright @1999, L.A. Rowe Outline • History • Broadcast System Elements • Scanning and Interlace • Color and Gamma • Chroma Subsampling • Representations Multimedia Systems and Applications 2 History • 1839: Daguerreotype Cameras • 1893: Telephone Audio Broadcasting (Puskas) • 1895: Wireless Communication (Marconi, Popov) • 1895: Film Presentation (Lumiere Brothers) • 1919: Radio Broadcasting (Holland, Canada) • 1934: US establishes FCC • 1935: TV Broadcasting (Germany, Britain) • 1941: US B&W TV ADD: telephone invention, founding of ATT, tv inventions, first radio networks/stations Multimedia Systems and Applications 3 History (cont.) • 1951: Videotape Recorder (Bing Crosby Enterprises) • 1953: US Color TV (NTSC) • 1963: Geostationary Satellites • mid ‘70s: Fiber Optic Transmission - cable • 1985: FCC establishes ATSC - standard by 1993? • 1989: Analog HDTV Broadcasting (Japan) • 1996: ATSC Standard Adopted ADD: first ntsc tv broadcasts, 1st cable system, consumer vcr, tivoli/replay device, direct tv, first hdtv broadcasts Multimedia Systems and Applications 4 Standards Groups • ITU-T – ITU Telecommunications Formerly CCITT • ITU-R – ITU Radiocommunications Formerly CCIR • FCC • SMPTE - Society of Motion Picture and Television Engineers … and many more! Multimedia Systems and Applications 5 Receiver Transmitter Broadcast System Goals: 1. Efficient use of bandwidth 2. High viewer perception of quality Multimedia Systems and Applications 6 Scanning and Interlace • Transmission is continuous signal Transmitter captures images and encodes Receiver decodes for display • Image traced out line-by-line Left-to-right top-down scanning vertical/horizontal blanking interval • Signal is interlaced Improves perception of motion (?) Alternative is progressive scanning Multimedia Systems and Applications 7 Video Display Scanning Amplitude Cathode Time • Video composed of luma and chroma signals • Composite video combines luma and chroma • Component video sends signals separately Multimedia Systems and Applications 8 Camera Operation Beam Splitter G B Camera Tubes Luma Encoder Zoom Lens Color Filters R Chroma • Camera has 1, 2, or 3 tubes for sampling • More tubes (CCD’s) and better lens produce better pictures Multimedia Systems and Applications 9 Direct View CRT • Three guns (RGB) energize phosphors Varying energy changes perceived intensity Different energies to different phosphors produces different colors Phosphors decay so you have to refresh • Different technologies Shadow mask (delta-gun dot mask) PIL slot mask Single-gun (3 beams) aperture-grille (Trinitron) Multimedia Systems and Applications 10 Aperture-grille –vs- Shadow Mask R G B Multimedia Systems and Applications R B G 11 Scanning Notation • Lines/”frame rate” specification NTSC PAL 525/59.94 625/50 • ATSC – everything is variable 1080i 720p 1920x1080 interlaced scanning 1280x720 progressive scanning • Why 59.94? Avoids interference problem between color and sound subcarrier Multimedia Systems and Applications 12 Interlaced Fields Signal Format Line 1 --- vertical blanking Line 21 --- Raster Format Field 1 1 2 3 485 Line 263 --- vertical blanking Line 283 --- ... 485 2 4 484 ... 485 Field 2 ITU-R Rec. 601: 720x483 Line 525 --Multimedia Systems and Applications 484 13 Scanning (525/59.94) Total Size square pixel 601 4fsc Active Active Active Multimedia Systems and Applications Active Area 780X525 644X483 858X525 720X483 910X525 757X483 14 Aspect Ratio/Refresh Rate • Aspect ratio Conventional TV is 4:3 (1.33) HDTV is 16:9 (2.11) Cinema uses 1.85:1 or 2.35:1 • Refresh Rate NTSC is 60Hz (59.94Hz) PAL is 50Hz Cinema is 48Hz (but still only 24 fps) Multimedia Systems and Applications 15 NTSC Video (525-lines, 60-fields/sec) • 525 scan lines repeated 29.97 times per second (i.e.33.37 msec/frame) • Interlaced scan lines divide frame into 2 fields each 262.5 lines (i.e.16.68 msec/field) • 20 lines reserved for control information at the beginning of each field Only 483 lines of visible data Laserdisc and S-VHS display around 420 lines (perception) Normal broadcast TV displays around 320 lines (“) • Line lasts 63.6 usec(10.9 usec blanked) Multimedia Systems and Applications 16 PAL Video (625-lines, 60-fields/sec) • 625 scan lines repeated 25 times per second (i.e. 40 msec/frame) • Interlaced scan lines divide frame into 2 fields each 312.5 lines (i.e. 20 msec/field) • Approximately 20% more lines than NTSC • NTSC vs. PAL roughly same bandwidth Multimedia Systems and Applications 17 Color Perception • Color is perceived lightwave 400nm to 700nm received at retina Humans more sensitive to brightness than color • Retina composed of cones and rods Cones respond to different frequencies (RGB) Rods measure brightness at low light levels (i.e., nightvision) • CIE established standards for color CIE XYZ, CIE xyY Linear RGB Multimedia Systems and Applications 18 Gamma • CRT is inherently non-linear Display changes based on voltage driving it • Human vision is also non-linear Lightness sensation is a power function of intensity (y=xw) • Serendipity happens… Non-linear CRT close to inverse human lightness Coding intensity into a gamma corrected signal maximizes perceptual image • Y -vs- Y’ Multimedia Systems and Applications 19 Gamma in Video camera transmission display • Camera performs gamma correction • Display imposes inverse power function y = x 1/0.45 Multimedia Systems and Applications 20 Color Image Coding • Image represented by 24 bit pixel (8 bpp) Each color value between 0 and 255 • Video uses non-linear coding Uniform distribution of colors to codes RGB R’G’B’ (gamma corrected RGB) • Video uses luminance/chromance R’G’B’ Y’CBCR Luminance is Y (technically luma is Y’) Chromance is CBCR Multimedia Systems and Applications 21 Jargon • CBCR actually color difference signals CB is scaled version of (Y’-B’) CR is scaled version of (Y’-R’) • Confusing terms: YUV, YIQ, Y CBCR,… Scaled versions of <Y’, Y’-B’, Y’-R’> Green has highest contribution to luminance • Luminance –vs- Luma (i.e., Y –vs- Y’) Y is linear luminance Y’ is gamma corrected luminance (aka luma) Multimedia Systems and Applications 22 Chroma Subsampling • Chroma subsampling reduces data 2 chroma/luma 16 bpp 1 chroma/luma 12 bpp • Notation is a:b:c a is luma samples b is chroma samples per odd line c is chroma samples per even line • Ex. 4:4:4, 4:2:2, 4:1:1, 4:2:0, … Multimedia Systems and Applications 23 Line Sampling Y Y Y Y Y 4:4:4 CR/CB CR/CB CR/CB CR/CB CR/CB Y Y Y Y Y 4:2:2 CR/CB Y Y CR/CB Y Y CR/CB Y 4:1:1 CR/CB 4:2:2 is referred to as broadcast quality 4:1:1 is referred to as VHS quality 4:2:0 is 2:1 down sampling in horizontal and vertical direction Multimedia Systems and Applications 24 4:2:0 Sampling Luma sample Chroma sample Multimedia Systems and Applications 25 Representations • Composite NTSC - 6MHz (4.2MHz video), 29.97 frames/second PAL - 6-8MHz (4.2-6MHz video), 50 frames/second • Component Separation video (luma, chroma) - svhs, Hi8mm RGB, YUV, YIQ, … YCBCR - used for most compressed representations • Separation video called “s-video” Multimedia Systems and Applications 26 Analog Video Representations • NTSC Y = 0.299R + 0.587G + 0.114B I = 0.596R - 0.275G - 0.321B Q = 0.212R - 0.523G + 0.311B composite = Y + Icos(Fsc t) + Qsin(Fsc t) • PAL Y = 0.299R + 0.587G + 0.114B U = 0.492(B-Y) Q = 0.877(R-Y) composite = Y + Usin(Fsc t) + Vcos(Fsc t) Multimedia Systems and Applications 27 Digitizing • Analog TV is a continuous signal • Digital TV uses discrete numeric values Signal is sampled Samples are quantized Small, discrete regions are digitized • Image represented by pixel array Multimedia Systems and Applications 28 Image Sizes 160 352 120 720 800 1152 1280 1920 QSIF (19Kp) SIF (82Kp) 240 601 (300Kp) 483 SVGA (500Kp) 600 ATV (1Mp) 720 900 Workstation (1Mp) HDTV (2Mp) 1080 Multimedia Systems and Applications 29 Workstation Images? • [1 Mega Pixel] 1152 x 900 = 1,036,800 pixels • [xvga] 1024 x 768 • [?] 1280 x 1024 Multimedia Systems and Applications = 786,432 pixels = 1,310,720 pixels 30 Digital Video Representations • Digital Composite Video(D2/D3,SMPTE 244M) 142 Mb/s data rate, either parallel or serial Subsampled color signals 4:2:2 • Digital Component Video(D1/D5,SMPTE RP125) Maintain separate signals for luma and chroma 270 Mb/s data rate, either parallel or serial Subsampled color signals 4:2:2 • Compressed Digital Video MPEG, MJPEG, H.26x, DV, … Multimedia Systems and Applications 31 Digital Video Block Structure macroblock • 4:2:2 YCBCR Y1 Y2 CB1 CR1 16x16 macroblock Y3 Y4 CB2 CR2 8x8 pixel blocks 8 bits/sample = 16 bits/pixel = 4Kbits/macroblock • 4:1:1 YCBCR 3Kbits/macroblock 12 bits/pixel Multimedia Systems and Applications Y1 Y2 Y3 Y4 CB CR 32 What is Video Data Rate? • Digital 720x483 = 347,760 pixels/frame 4:2:2 sampling gives 695,520 bytes/frame 21 MB/sec (167 Mbs) 4:4:4 sampling gives 250 Mbs • ATV (MPEG MP@ML) 1280x720 = 921,600 pixels/frame 4:2:0 sampling gives 1,382,400 bytes/frame 41 MB/sec (328 Mbs) (Note: MPEG coded streams are 1.5-80 Mbs) Multimedia Systems and Applications 33 What is Video Data Rate (cont.)? • ATSC (720P) 720x1280 = 921,600 pixels per frame 4:2:2 sampling = 1,843,200 bytes per frame 24 fps = 44,236,800 bytes per second 44 MB/s = 354 Mbs • ATSC (studio 1080I) 1080x1920 = 2,073,600 pixels per frame 4:4:4 sampling = 6,220,800 bytes per frame 30 fps = 186,624,000 bytes per second 187MB/s = 1.5 Gbs Multimedia Systems and Applications 34 Serial Digital Data Rates • Serial digital is a video transport standard Widely used in broadcast and production studios • Data rates NTSC SDI approximately 200 Mbs ATSC SDI approximately 1.5 Gbs • Routing Switcher Remember typical facility might have a 256x256 routing switcher What is data rate across the switch? Multimedia Systems and Applications 35 Human Perception •What is smooth motion Depends on source material Most action is perceived as smooth at 24 fps •Human most sensitive Low frequencies Changes in luminance and blue-orange axis •Vision emphasizes edge detection Strong bias to horizontal and verticle lines •Visual masking by large luminance changes Multimedia Systems and Applications 36 Producing High Quality Video • Need high quality camera S-Video(SVHS, Hi8mm) better than composite 3 chips better than 1 chip Digital better than analog • Lights, lights, lights… Experiment with filters to change apparent colors • Shoot scene from different angles and cut between them to create visual stimulation • Study film/video techniques Let person exit the scene without moving camera Keep orientation of images correct Change scene/shot to reflect time change Multimedia Systems and Applications 37 Conclusions • NTSC/PAL were excellent standards that lasted over 50 years Technology has changed dramatically during this time • Digital revolution is changing industry Main impact is development of ATSC standard • Internet revolution is also chaning industry Webcasting –vs- mass market broadcasting Wireless –vs- cable –vs- packet transmission Multimedia Systems and Applications 38