DIP_overview

advertisement
Why D.I.P.?
Reasons for compression
– Image data need to be accessed at a different time or
location
– Limited storage space and transmission bandwidth
Reasons for manipulation
– Image data might experience nonideal acquisition,
transmission or display (e.g., restoration, enhancement and
interpolation)
– Image data might contain sensitive content (e.g., fight
against piracy, conterfeit and forgery)
– To produce images with artistic effect (e.g., pointellism)
Reasons for analysis
– Image data need to be analyzed automatically in order to
reduce the burden of human operators
– To teach a computer to “see” in A.I. tasks
Beyond Image Processing
The way of thinking
– From art (heuristics) to science (principles)
– The key is mathematics (I will write a separate
blog about the role of mathematics in DIP)
The holistic view
– Everything is connected (recall the six-degree
phenomenon in social science)
The “Google”-style re-search
– Ability to search is a basic part of learning
Image
Acquisition
D.I.P.
Theme
Park
Image
Generation
Image
Compression
Image
Manipulation
Image
Display
Image
Analysis
Image
Perception
DIP is also about connecting dots – in image compression, you
will see why you need to learn matrix theory and statistics
The Art of Image Compression
Why are images compressible?
– Redundancy in images (NOT random)
How data compression works?
– Probability theory and statistics
– Shannon’s information theory
What about the future of image
compression?
– I will discuss this in my weblog and facebook
(Google “what happened to Iterated Systems Incorporated?”)
Shannon’s Picture on
Communication (1948)
source
channel
encoder
channel
channel
decoder
destination
super-channel
source
encoder
source
decoder
The goal of communication is to move information
from here to there and from now to then
Examples of source:
Human speeches, photos, text messages, computer programs …
Examples of channel:
storage media, telephone lines, wireless transmission …
Lossless vs. Lossy Compression
Lossless: zero error tolerance
– No information loss
– Shannon’s entropy formula
– For photographic images, compression ratio is
modest (about 2:1)
Lossy: the goal is to preserve the visual quality of
images
– Information loss visually acceptable
– Shannon’s rate-distortion function
– For photographic images, compression ratio is
typically around 10-100
Popular Lossless Image
Compression Techniques
 WinZip
- Based on the celebrated Lempel-Ziv algorithm
invented nearly 30 years ago
 GIF (Graphic Interchange Format)
-Based on an enhanced version of LZ algorithm
by Welch in 1983
-Was introduced by CompuServe in 1987 and made
popular until it was not royalty-free in 1994
 PNG (Portable Network Graphics)
GIF Liberation Day: June 20, 2003
Lossy Image Compression
compressed JPEG file
(20,407 bytes)
100
Q
JPEG
decoder
Q
low compression ratio
decompressed image
high quality
high compression ratio
0
low quality
original raw image
(262,144 bytes)
From JPEG to JPEG2000
discrete cosine transform based
JPEG (CR=64)
wavelet transform based
JPEG2000 (CR=64)
Image
Acquisition
D.I.P.
Theme
Park
Image
Generation
Image
Compression
Image
Manipulation
Image
Display
Image
Analysis
Image
Perception
DIP is also about connecting dots – in image manipulation, you
will see why you need to learn calculus and Fourier transform
Image Manipulation (I):
Noise Removal
Noise contamination is often inevitable during the acquisition
salt and pepper (impulse) noise additive white Gaussian noise
You will learn how to design image filter in a principled way
Image Manipulation (II):
Deblurring
License plate is barely legible due to motion blurring
You will learn the use of FT and the necessity of regularization
Image Manipulation (III):
Contrast Enhancement
under-exposed image
overly-exposed image
You will learn how to modify the histogram of an image
Image Manipulation (IV):
Aliasing Reduction
Example: aliasing artifacts in MRI image acquisition
Ideal quality,
slow scanning
nonideal quality,
fast scanning
Tradeoff between scanning time and image quality
(image reconstruction is covered by EE425)
Image Manipulation (V):
Image Interpolation
digital
zooming
small
1M pixels
large
4M pixels
Resolution enhancement can be obtained by common image
processing software such as Photoshop or Paint Shop Pro
You will learn the difference between digital and optical zooming
Image Manipulation (VI):
Image Mosaicing
Merge multiple images of the same scene into one with larger FOV
+
=
There exist several mosaicing software for automatic stitching
F.Y.I.: search “Gigapixel images” by Google
http://triton.tpd.tno.nl/gigazoom/Delft2.htm
Image Manipulation (VII):
Error Concealment
blocks contaminated by channel errors
(this problem is covered in EE565)
Image Manipulation (VIII):
Deblocking/Deringing
Block artifacts
Ringing artifacts
You will learn how to suppress those artifacts by nonlinear diffusion
Image Manipulation (IX):
Dejittering
jittering noise (you will see it in either
bonus assignment or final project)
Image Manipulation (X):
Image Inpainting
You will learn how to do image inpainting in EE565
Image Inpainting Application:
Restore Old Photos
Image Manipulation (XI):
Color Quantization
25,680 colors (24 bits)
256 colors (8 bits)
Applications: video cell-phone, gameboy, portable DVD
Image Manipulation (XII):
Image Halftoning
grayscale: 0-255
halftoned: 0/255
You will learn the famous Floyd-Steinberg diffusion in CA
Image Manipulation (XIII):
Image Scrambling/Hashing*
original
scrambled
Image Manipulation (XIV):
Image Watermarking*
Original image
Modified image
Image Manipulation (XV):
Image Stylization*
Image Manipulation (XVI):
Image Rendering*
computer
generated
Abyss
Image-based Rendering*
Image
Acquisition
D.I.P.
Theme
Park
Image
Generation
Image
Compression
Image
Manipulation
Image
Display
Image
Analysis
Image
Perception
DIP is also about connecting dots – in image analysis, you
will see why you need to know about neuroscience and psychology
Image Analysis (I): Edge Detection
You will learn basic edge detectors based on derivatives
Image Analysis (II): Face Detection
Deceivingly simple for humans but notoriously difficult for machines
Image Analysis (III): Change Detection
Change Detection in Medical Application
Image Analysis (IV): Image Matching
Antemortem dental X-ray record
Postmortem dental X-ray record
Image Matching in Biometrics
Two deceivingly similar fingerprints of two different people
Image Analysis (V):
Image Segmentation
Image Analysis (VI): Object Recognition
License number can be automatically
extracted from the image of license plate
Object Recognition in Military
Applications
Image Analysis (VII): Event Recognition
Image-based monitoring system prevents drowning
Image Analysis (VIII):
Video Summarization
Only send out “important” motion pictures such as home-runs
Image Analysis (IX):
Content-based Image Retrieval
retrieved building images
Summary
In EE465, you will learn
– Image compression: Lempel-Ziv, Huffman coding, run-length
coding and JPEG
– Image manipulation: linear/nonlinear filtering, histogram-based
processing, linear interpolation
– Image analysis: edge/corner detection, circle/ line/ellipse
detection, chain codes/shape numbers
In EE565, you will learn
– Advanced algorithms/techniques with stronger mathematical
emphasis
Not covered by the courses I offered
– CBIR (multimedia database), face/pedestrian detection
(Advanced biometrics), PDE-based image processing (medical
image analysis)
General Cooking Recipe
Motivations
Problem statement
Heuristic observations
Examples/Illustrations
Principled approaches (via mathematics)
MATLAB implementations
Computer assignment reviews
Current state-of-the-art and future
directions
Download