Course Outline

advertisement
CS-565 Computer Vision
FALL – 2015
CS-565 – Computer Vision
Instructor: Dr. Nazar Khan
Semester: Fall 2015
Campus: Allama Iqbal
Passing this course is necessary for students planning to register for an
M.Phil Thesis with Dr. Nazar Khan.
Course Description:
Human beings (and even animals) “look” at the real-world and extract extremely accurate
information extremely efficiently. Computers can fail catastrophically at this task! In this course
we look into why “Vision” is a difficult problem to solve and we go through successful,
mathematically well-founded techniques used to solve the Vision problem.
This course is a useful application of mathematical concepts from Linear Algebra and Calculus.
Therefore, the students could do well by brushing up on their Linear Algebra, Calculus and
programming skills before taking this class. The techniques learned here can be useful for other
areas such as Image Processing, Machine Learning, Artificial Intelligence and Computer
Graphics.
Who can register?
This course is open to graduate students as CS-565. It is also open to undergraduate students
as SE-461.
Class size will NOT exceed 40 students. Graduate students have top priority. Remaining seats
will be filled by undergraduates based on
1. CGPA
2. FYP topic
3. Instructor's permission (only in exceptional circumstances).
To determine course grade, graduate students will be evaluated in a more rigorous manner.
CS-565 Computer Vision
FALL – 2015
Goals and Objectives:
1. General overview of the field of Image Analysis.
2. Learn fundamental techniques and mathematical models in the areas of Image Processing
and Computer Vision.
3. Understand typical Computer Vision pipelines for solving problems.
By the end of the semester, a student should:
1.
2.
3.
4.
5.
6.
7.
not be afraid of Mathematics,
be able to develop Computer Vision applications,
explain fundamental algorithms in the area along-with their mathematical subtleties,
appreciate that Computer Vision is a deceptively hard problem,
be equipped to read research papers in the area,
report results in a scientifically satisfactory manner, and
be comfortable enough with the terminology and techniques so as to pursue graduate
level research in the area.
Related Text / Reading Material:
We will not be following one particular reference. However, relevant material can be found in:
Image Processing
Digital Image Processing by R. C. Gonzalez, R. E. Woods.
Computer Vision
Introductory techniques for 3D Computer Vision by Trucco and Verri
Image Processing, Analysis, and Machine Vision by Milan Sonka, Vaclav Hlavac, Boyle Roger
Computer Vision: Algorithms and Applications (http://szeliski.org/Book/)
Multiple View Geometry in Computer Vision by Hartly and Zisserman
Computer Vision: A Modern Approach by Forsyth and Ponce
Computer Vision by Linda G. Shapiro, George C. Stockman
CS-565 Computer Vision
Scheme of Study:
1. Introduction (1 Week)
a. Computer Vision vs. Image Processing vs. Computer Graphics
b. Computer Vision vs. Biological Vision – The Grand Deception!
c. Successful Computer Vision solutions.
2. Mathematical Background (1 Week)
a. Cartesian vs. Image axis
b. Taylor's formula
c. Constrained optimisation
d. SVD
3. Image Processing (4 Weeks)
a. Image Types and their degradations.
b. Color Perception and Color Spaces
c. Filtering and Convolutions
i. Gaussian Smoothing
ii. Derivative Filtering
d. Fourier Transform
i. Basic idea and background math
ii. Euler's formula for circular motion
iii. Projection onto circular motion basis
iv. Frequency domain filtering
e. Image Adjustment
4. 2D Computer Vision (5 Weeks)
a. Edge Detection
b. Corner Detection
c. Hough Transform
d. Spatial Transformations
i. Types of spatial transformations
ii. Recovering best transformation
iii. Image warping
e. Optical Flow
i. Local Methods
ii. Global Methods
5. 3D Computer Vision (3 Weeks)
a. Projective Geometry
i. Camera Models
ii. Camera Matrix Anatomy
iii. Camera Calibration
FALL – 2015
CS-565 Computer Vision
FALL – 2015
b. Stereo Reconstruction
i. The case of orthoparallel cameras
ii. The case of converging cameras
1. Epipolar constraint and fundamental matrix
2. Estimation of fundamental matrix
3. Disparity estimation
6. Machine Learning for Computer Vision (1 Week)
a. Principal Component Analysis
Grading Scheme/Criteria:
Assignments
Quizzes
Mid
Final
20%
5%
35%
40%
Grading Policy:
1. To determine course grade, graduate students will be evaluated in a more rigorous
manner.
2. Theoretical assignments have to be submitted before the lecture on the due date.
3. There will be no make-up for any missed quiz.
4. Make-up for a mid-term or final exam will be allowed only under exceptional
circumstances provided that the instructor has been notified beforehand.
5. Instructor reserves the right to deny requests for any make-up quiz or exam.
6. Worst score on quizzes will be dropped.
7. Worst score on assignments will be dropped.
Download