Video

advertisement
Video Processing
CSC361/661 -- Digital Media
Spring 2004
Burg/Wong
1
What’s the difference between
analog and digital video?


The way the signal is sent
Every pixel element and audio sample is
a stream of 0’s and 1’s
2
What’s so good about
digital video?









Less subject to noise.
Can be stored and transmitted on a computer
Digital editing techniques are powerful and also allow
the video to be integrated into othere multimedia
applications.
Editing can be non-linear and non-destructive.
Interactive elements can be added during editing.
Can be reproduced repeatedly without degradation of
quality.
Can be easily compressed and encrypted.
Can be replayed non-linearly and in still images.
One digital video file can be replayed with different
settings depending on the system where it is
replayed (e.g. resolution, frame rate, color).
3
Organization of Video Signals




Component video – 3 wires/connectors
connecting the camera or other devices to a
TV or monitor; three separate signals for the
image components
YUV or YIQ works well for analog video
Advantage: The 3 components of the signal
are on separate lines, so there isn’t any
electromagnetic interference among them.
Disadvantage: Requires more bandwidth and
synchronization.
4
Organization of Video Signals




Composite video – all the image
information (e.g., YUV) sent on one line or
channel.
Luminance and chrominance components are
separated on the receiver end.
Connecting a TV with a VCR can be done this
way – one connection (audio signal
connected separately.)
There can be some interference among YUV
components.
5
Organization of Video Signals




S-video – a compromise between
component and composite video.
Uses two wires or channels – one for
luminance and one for chrominance.
Not as expensive as component video.
Less interference between the two compared
to composite signal.
6
Scanning Methods




Interlaced – odd lines are displayed first,
then even lines
Taken together, all the odd lines are called a
field (and similarly for all the even lines).
The original purpose of interlaced display was
to avoid flicker.
Standard television uses this method,
displaying 60 fields per second (which makes
30 frames per second).
7
Scanning Methods




Progressive – the method used by
computer monitors.
An entire screen can be written to a buffer.
The buffer is displayed “instantaneously.”
Think about how analog video would be
converted to digital – the fields would have to
be put back together, which can create
interlacing artifacts.
8
Standards Organizations for
Video (originallly analog,
extended to digital)



NTSC
 National Television Systems Committee
 North America, Japan, Taiwan, and parts of the Caribbean
and South America
 525 scan lines, 29.97 frames/s/ 4:3 aspect ratio, YIQ color
model, interlaced fields
PAL
 Phase Alternating Line
 France, Australia, New Zealand
 625 scan lines per frame, 25 frames/s, 4:3 aspect, ratio,
YUV color model, interlaced fields
SECAM
 Système Electronique Couleur avec Mémoire
 Soviet Union and Eastern Europe
 625 scan lines per frame, 25 frames/s, 4:3 aspect, ratio,
YUV color model, interlaced fields (differs from PAL in the
9
color coding scheme
Analog Signal for Video
Transmission (NTSC)
525 lines/frame * 29.97 frames/s ≈
15,734 lines/s
Each line must be “swept out” in
1/15,374 secs ≈ 63.6 μsec.
The horizontal retrace signal takes 10.9
μsec.
This leaves 52.7 μsec for the active line
signal giving image data.
10
Analog Video Signal
11
Converting Analog Video to
Digital





CCIR 601 (one standard for digital video) specifies a
standard that applies to both NTSC and PAL.
According to the CCIR standard, a frame is sampled
to 720 X 480 pixels for NTSC and 720 X 576 for PAL.
But this is misleading. There aren’t really 720 pixels
per line. The number of samples taken to digitize the
video doesn’t necessarily correspond to the number
of pixels on the display device.
You can do digital video in NTSC format at 640 X 480
or 720 X 480 with different pixel aspect ratios.
You can do digital video in PAL format at 720 X 480
or 720 X 576 with different pixel aspect ratios.
12
Subsampling


CCIR 601 prescribes 4:2:2 subsampling
of the chrominance component.
This means that there in every 4-pixelsquare area, 4 luminance samples are
taken and 2 of each of the chrominance
samples are taken (4 Y’ samples, 2 CB
samples, and 2 CR samples).
13
14
Digital Television, SDTV, HDTV

SDTV vs. HDTV



Is HDTV the same thing as digital TV?


Standard definition television
High definition television
no
Characteristics of HDTV

16:9 aspect ratio (1280 X 720 or 1920 X 1080)
and Dolby digital surround sound (AC-3)
15
HDTV vs. DTV





High definition television is not necessarily digital –
that is, it does not have to be digitally transmitted.
What characterizes HDTV is the aspect ratio,
resolution, and sound quality
Digital television is not necessarily HDTV.
What characterizes DTV is the way in which the data
is transmitted – in digital, as opposed to analog,
form.
HDTV was not originally DTV, but at present most
HDTV is digitally transmitted.
16
Digital Television





There are 18 different DTV formats
Six are also HDTV.
Five of these (the DTV formats that are also
HDTV) are based on progressive scanning
and one on interlaced.
Both HDTV and DTV use MPEG-2
Three of the 18 formats for DTV that are
used frequently are:



480p – 640 X 480 pixels, progressive
720p – 1280 X 720 pixels, progressive
1080i – 1920 X 1080 pixels, interlaced
17
Digitizing Video

One of the biggest considerations, of course,
is file size.

Size of file =
frame rate * frame width * frame height * bytes per
pixel * number of seconds
For example:
30 f/s * 640 pix * 480 pix * 3 bytes/pix * 60 s
= 1,658,000,000 bytes= ~ 1.6 GB

18
Where does file size create
challenges?




In capturing digital video, your hardware and
software has to be able to keep up with the data
rate.
When the file is stored, you have to have enough
room on your hard disk.
When the file is downloaded and then played, your
user has to have the patience to wait for the
download.
When digital video is played in real-time, the data
transmission rate has to be fast enough to keep up
with the rate at which the video should be played.
19
Ways to capture digital video



Copy something that is already in video
format (either digital or analog) to your
computer.
Directly, live, from a digital camera,
either analog or video
Pick up a live video broadcast signal on
your computer.
20
What equipment do you need to
do this?




If source is already digital video, it can be
transmitted directly to computer.
If the source is recorded analog video, a video
capture card must convert analog to digital.
Digital camera may digitize and compress before the
data is sent to the computer.
Connect camera through high speed Firewire (IEEE
1394 interface) or USB.(USB can handle data transfer
rates of 1.5 to 480 Mb/s. Firewire can handle up to
800 Mb/s.)
21
Advantages and disadvantages of
digitizing in the camera

Advantage – less noise from transmission.



Noise degrades the quality.
Noise makes compression more difficult.
Disdvantage – you have to use some
standard format and don’t have as much
control over compression and data rate.


Noise degrades the quality.
Noise makes compression more difficult.
22
Codec


Compressor/decompressor
Hardware or software?

Both


Hardware on a video capture board
Software – compression done within your video
processing program. Decompression done at
the user’s computer.
23
Where and when to compress

In the digital camera, after which the digital
stream is passed to the computer through
FireWire (IEEE 1394). DV is standard.




DV data rate is ~3.5 MB/s, 720X480 NTSC
In the video capture card on the computer,
where the video is converted from analog to
digital and compressed using the card’s
hardware.
In a software codec.
Compression is usually done twice – once
during capture, and again as the video is
prepared for distribution.
24
Compression Strategies

Spatial compression (intra-frame)


Temporal compression (inter-frame)




Areas that are alike can be grouped in a more
concise representation.
Record the difference between one frame and
another. Look up tables
Keep a table of typical patterns
Record a small part of the image as an entry into
the table
Downsampling (luminance/chrominance)
25
Examples of Codecs





Cinepak
Intel Indeo
Sorenson
MPEG
Even with these codecs, you generally can’t
compress video enough so that it can be
played in full screen on a mid-range
computer. You can get about 320X240 at 12
frames per second
26
Cinepak




Gets a good compression rate
Decompresses a lot faster than it
compresses
Uses vector quantization and temporal
compression with key frames and
difference frames
Good for video with a lot of motion
27
Intel Indeo



About 30% faster than Cinepak at
compresses
Preserves color well on video with a lot
of static scenes
Uses vector quantization and temporal
compression with key frames and
difference frames
28
Sorenson





Available as part of QuickTime
Newer than the other two. Good quality,
good compression rate
Can compress to a data rate of 50 KB/s
Uses vector quantization and temporal
compression with key frames and difference
frames
Sorenson’s motion compensation method is
similar to the one used in MPEG compression
29
MPEG




An international standard
Motion Picture Experts Group
MPEG-1, 2, 3, and 4. As the numbers get
higher, you get more compression, so the
compression method is suitable for more
“challenging” material
The standard specifies how the data stream is
formatted after compression and how it will
be decompressed, but not how the
compression has to be implemented
30
MPEG-1






Released in 1992
Designed for audio/video played mainly from CDROMS and hard drives
Compression ratio of about 4:1; depends on
application
Typical data rate of 1.86 Mb/s -- for video that can
be stored on CD, VHS quality
Progressive scan (can’t handle interlacing or HDTV
formats)
Typically 320X240 (square pixel format) or 352X240
(SIF – Source Input Format), 29.97 frames per
second
31
Terminology for MPEG
Compression





Block is 8 X 8 pixels
Macroblock is 16 X 16 pixels
Progressive scanning displays line after
line
Interleaved scanning displays odd
numbered lines, then even
A field is either all the odd numbered
lines or all the even numbered lines
32
MPEG Compression



I, P, and B frames are designated
A GOP (group of pictures) size is chosen
GOP size is usually about 8, 12, or 16
frames
33
Steps in MPEG compression

I frames are compressed like static images
are with JPEG compression







4:2:0 or 4:2:2 downsampling
DCT
Quantization
Run-length and Huffman encoding
Need about 2 I frames/s for random access
Each P frame is encoded with reference to
the previous I or P frame
Each B frame is encoded with reference to
previous or subsequent frames
34
Discrete Cosine Transform

DCT for an N X N pixel image p xy ,0  x  N ,0  y  N
the DCT is an array of coefficients:
DCTuv ,0  u  N ,0  v  N 
1
N 1
DCTuv 
Cu Cv  x 0
2N

 (2 x  1)u 
 (2 y  1)v 
 y 0 pxy cos 2 N  cos 2 N 
N 1
where
1
Cu , C v 
for u , v  0
2
Cu , Cv  1 otherwise
35
So what is MP-3?

MP3 is audio encoding in the standard of
MPEG1 audio Layer 3. There are three audio
“layers” possible in MPEG encoding




Layer 1 32-448 Kb/s
Layer 2 8-384 Kb/s
Layer 3 8-320 Kb/s
All three layers use psychoacoustical
encoding methods

If one frequency component is going to be
masked by another one, it doesn’t matter if you
drop it
36
MPEG-2









Released in 1994
Intended as a coding standard for SDTV and HDTV
with data rates of 1.5-60 Mb/s; 15 Mb/s is typical
(“Main Profile at Main Level” – MP@ML)
Scalable, as compared to MPEG-1
Profiles describe functionality, levels describe
resolution
Broadcast quality
Supports interlaced video
720X480 frame
4:2:2 or 4:2:0 downsampling
About 8:1 compression ratio; depends on application
37
MPEG-3




Originally intended for HDTV
MPEG-2 was sufficient for HDTV
Projected 12:1 compression ratio
Not really used much
38
MPEG-4



Standardized in 1998, but still under
development
Can mix video with text, graphics, and
2-D and 3-D animation layers
5 Kb/s to 4 MB/s
39
Maximum Data Transfer Rates



POTS
28.8-56 Kb/s
ISDN (Integrated Services Digital Network)
64-128 Kb/s
ADSL (Digital Subscriber Line) 1.5-8.5 Mb/s
(downstream)


VDSL
CATV
16-640 Kb/s (upstream)
12.96-55.2 Mb/s
20-40 Mb/s
40
Speed of Delivery from
Storage Devices

CD





150 KB/s
300 KB/s
1200 KB/s
7.8 MB/s
1x
16x
1.35 MB/s
21.6 MB/s
DVD



1x
2x
8x
52x
SCSI Hard Drive

10-20 MB/s (or even as high as 100 MB/s)
41
Storage Capacity of Storage
Devices

CD


700 MB
DVD

4.7 GB
42
Download