Digital Media Dr. Jim Rowan ITEC 2110 Video

advertisement
Digital Media
Dr. Jim Rowan
ITEC 2110
Video
Video
• Works because of persistence of vision
• http://en.wikipedia.org/wiki/Frame_rate
• Fusion frequency
– ~ 40 frames per second
– depends on the brightness of the image relative to
the viewing environment
• Less than that
– flickering
– individual images appear losing the illusion of
motion
Video
• Video vs Animation...
– Video - capture of frames and then
playback
– Animation - create frames individually and
then playback
Video...
Computationally demanding
– Capture must be fast enough to capture sufficient
frames to produce the illusion of motion
– Transport (if across the web) must be fast enough
to carry those captured frames at a rate fast
enough to produce the illusion of motion
– Playback must be fast enough to play those
captured frames at a rate fast enough to produce
the illusion of motion
Video
• If the transport and playback is not fast
enough, frames will be dropped
• Video players (like quicktime) make
compromises differently
– Trying to “degrade gracefully”
• Some drop frames holding the last image
– effectively losing the illusion of motion but continuing the
story as a slide show
• Some play lower resolution images
• Some continue to play audio
Video Standards
• NTSC
– America and Japan
– http://en.wikipedia.org/wiki/Ntsc
– North America and Japan
– 24 frames per second
– framesize different than PAL
Video Standards
• PAL
– Western Europe and Australia
– http://en.wikipedia.org/wiki/PAL
– Western Europe and Australia
– 25 frames per second
– framesize different than NTSC
Video gets big fast
• At a 640 X 480 framesize
• Using 3 byte color (24 bits, one byte per
color) each frame ~ 2 megabytes
• One second of video (uncompresssed)
is 26 Megabytes
• One minute is 1.85 gigabytes
The effects of large size...
• Uncompressed this exceeds most home
computer interface standards
• Strains the internal speed of the home
computer
• Strains the storage capability of home
computer
• WAY exceeds what can be carried by
the net
What to do?
Apply compression!
• On the capture side
– digitization & compression needs to be
carried out by the hardware to be fast
enough
– Can be done in the camera (VTR)
– Can be done in the computer (iSight cam)
Compression in the VTR
• Within the camera
– 3 (at least) different formats internally
• Differing error correction and compatibility
– Recording on different media
• CD
• tape
• Mini DV or DV format
– Connected to computer using firewire
– All 3 formats present the same stream of bits to
the computer
• Artifacts that interfere with processing and
recompression are created
Compression in the computer
• Analog is presented to the computer
through a video capture card
• Compression is done (usually) in the
video capture card
• Allows for a really small camera
because the work is done elsewhere
Analog vs Digital
• An analog signal to the computer is
susceptible to noise corruption
• Digital signal is not
• What’s the big deal?
• Consider compressing a video of a wall
painted a solid color
– Analog noise will cause small fluctuations from
pixel to pixel
– RLE can’t compress it because each pixel is a bit
different
Consider compressing
this using RLE
analog signal
!!!NOISE!!!
computer
iSight
Camera
video capture card
compression
digital
signal
miniDV
the scene
iMovie
compression
640 x 480 = 307,200
307,200 can be represented
by < 24 bits, call it 3 bytes
RLE:
307,200 (3bytes) +
RGB (3 bytes) = 6 bytes
640 x 480 = 307,200 bytes
Noise makes each pixel
a little different
RLE:
307,200 bytes x RGB (3bytes)
= 921600 bytes
hardware vs software
compression
• Hardware conversion... user has no control
over it... it is hardwired
– It is in the camera
– It can be in the video card
• Software conversion... is computationally
expensive... it’s a slow process
– Provides for the most flexibility
– Can use different software coder-decoders
(codec), picking and choosing what fits your needs
better
Streaming Video
http://en.wikipedia.org/wiki/Streaming_media
Video is
transported across the web
played as it arrives
Similar to broadcast TV
Can be part of video conferencing
Network bandwidth is the enemy
3 Methods of Video delivery
• True Streaming video
• Embedded video
• Progressive download or HTTP
streaming
True Streaming video
Network programs are streamed
• never stored on disk
• can be open-ended
• you don’t get a copy to play later
– solves copyright issue!
• can collect viewer stats without Nielsen ratings
• advertisers know how many viewers there are
• on some, you can advance ahead of where the
download is complete and the play will start again at
the new spot
Embedded video
• entire file is transferred before play begins
• stored on disk
• may require more disk than is available
Progressive download
or
HTTP streaming
• downloads some and then starts playing
• predicts when the download will complete and starts
playing when there is enough downloaded to safely
play without interruption
• you get a copy to play later
• takes up disk storage
• quicktime does this
• on some, you can advance ahead of where the
download is complete and the download will start
again at the new spot
The TV legacy
• Raster scan - 525 lines in US and Japan
• Not enough bandwidth to transmit the
whole frame at a rate of 40+ frames per
second
• Resulted in Interlacing fields
• Play half the Frame (and half the data)
then play the other half of the Frame
525 lines total
525 lines total
525 lines total
The problem...
• The fields were played one after the
other to avoid flicker
• BUT...
• The fields were also captured one after
the other...
• there was a time difference between
when they were captured
Why is this a problem?
• When you play them on a computer, the
computer can refresh much faster and can
display the entire frame at the same time
• To play it can put both fields in a frame buffer
and displays them at the same time
• If the object is moving fast, the second frame
shows the object in a different place
• Results in a “comb effect”
Can we solve the problem?
• You can average the two frames and
construct a single frame
• You can toss out one of the fields and
interpolate between them
• Neither is very good...
Film to video?
• Problematic
– video is 30 frames per second
– film is 24 frames per second
• How do you make 30 frames from 24?
• The 3-2 pull down
– http://en.wikipedia.org/wiki/3-2_pulldown
– 3 film frames occupy the first 3 fields
– 2 film frames occupy the next 2 fields
Next
• DV and MPEG
Questions?
Download