CS576 Computer Vision Instructor: Dr. Yu-Wing Tai No official TA Tuesday and Thursday 4:00pm – 5:30pm Rm 3444 E3-1 Building 1 Who am I ? Course Webpage: http://yuwing.kaist.ac.kr/courses/CS576/index.html Instructor Info: Dr. Yu-Wing Tai yuwing@cs.kaist.ac.kr / yuwing@gmail.com Education: PhD National University of Singapore 2009 M.Phil Hong Kong University of Science and Technology 2005 B.Eng Hong Kong University of Science and Technology 2003 Research: Computer Vision, Image/Video Processing Web: http://yuwing.kaist.ac.kr/ Office hours: arrange by email. 2 Grading (Absolute Grading) • You start with the letter grade “C-”, you earn sub-grade from: – – – – – – 2 course projects (2 sub-grades) 1 term project (1-2 sub-grades) Peer review of term project (1 sub-grade) 1 oral examination (1 sub-grade) Bonus (See project description) Attendant (At least 60%, 1 sub-grade) • Best Grade: A+ (No limit on number of A+) • Worst Grade: D+ (No exception for better grade) 3 Project 1: Feature Detection and Matching • Descriptions: – http://yuwing.kaist.ac.kr/courses/CS576/project1/index.html • Deadline: Friday mid-night (00:00) on Week 5 • A project directly copy from the first project of the computer vision class in U. Washington • 1 sub-grade for this project; 0.5 sub-grade for unsuccessful implementation; 0 sub-grade for late submission • No bonus sub-grade for Project 1 • Goal: – Fundamental project for computer vision class – Learning how to use skeleton codes from others – Learning how to find related internet resources for project 4 Project 2: Paper implementation • Descriptions: – http://yuwing.kaist.ac.kr/courses/CS576/project2/index.html • Deadline: Friday mid-night (00:00) on Week 9 • 1 sub-grade for this project; 0.5 sub-grade for unsuccessful implementation; 0 sub-grade for late submission • Maximum 1 Bonus sub-grade for extra successful submission • Goal: – – – – Paper reading Find your own interests area Learning how to find related resources Learning how to reproduce previous research projects 5 Term Project: A mini-conference submission • Descriptions: – http://yuwing.kaist.ac.kr/courses/CS576/termproject/index.html • Deadline: Friday mid-night (00:00) on Week 13 • 1 sub-grade for submission; 1 sub-grade for acceptance; 0 sub-grade for late submission • Acceptance rate will be about 30% - 50% • 1 sub-grade for peer evaluation • Bonus sub-grade for best presentation • Bonus sub-grade for 2 outstanding reviewers • Bonus sub-grade(s) for extra-acceptance project • Goal: – Understand the process of research cycle and paper submission – Testing your abilities/potentials in doing research/getting PhD 6 Oral Exam • 10-15 minutes face-to-face question and answer session testing your knowledge and your understanding to the term project • Week 15 • 1 sub-grade for passing; 0.5 sub-grade for failure; -1 sub-grade for absent • Goal: – Testing your knowledge – Getting course feedback 7 Course Resources • Books (Available for borrowing): – Computer Vision: Algorithms and Applications © 2010, Richard Szeliski – Computer Vision: A Modern Approach © 2002, David A. Forsyth and Jean Ponce • Computer Vision papers: http://www.gmazars.info/conf/ • Computer Graphics papers: http://kesen.huang.googlepages.com/ • Microsoft Academic Search http://academic.research.microsoft.com/ • Google http://www.google.com/ 8 Course Resources • Computer Vision Source Codes http://www.cs.cmu.edu/~cil/v-source.html • OpenCV http://sourceforge.net/projects/opencvlibrary/ • The Middlebury Computer Vision Pages http://vision.middlebury.edu/ • Computer Vision Algorithm Implementations http://www.cvpapers.com/rr.html • Computer Vision Datasets http://clickdamage.com/sourcecode/cv_datasets.html • Columbia University Computer Vision Lab http://www.cs.columbia.edu/CAVE/ 9 Other Texts • UNDERGRADUATE – A Guided Tour of Computer Vision, by V. S. Nalwa, AddisonWesley, 1993. – Introductory Techniques for 3-D Computer Vision, by Emanuele Trucco, Alessandro Verri, Prentice-Hall, 1998 • GRADUATE/ Specialized Reference – Multiple View Geometry, by Richard Hartley, Andrew Zisserman, Cambridge University Press, 2000. – Numerical Recipes in C, by William Press et al., Cambridge Univ Press, 1992. – Pattern Classification and Scene Analysis, by Richard O. Duda, Peter E. Hart, John Wiley & Sons, 1973. – Convex Optimization, by Stephen Boyd and Lieven Vandenberghe, 2010. 10 What is Computer Vision ? • In 1966, Marvin Minsky at MIT asked his undergraduate student Gerald Jay Sussman to “spend the summer linking a camera to a computer and getting the computer to describe what it saw” • “In the 1960s, almost no one realized that machine vision was difficult.” – David Marr, 1982 • 40+ years later, we are still working on this… 11 What is Computer Vision ? 12 The goal of computer vision • To bridge the gap between pixels and “meaning” 13 What is Computer Vision ? From Wikipedia 14 What is Computer Vision ? Computer Vision Image Processing Image Enhancement, Denoising, etc. Stereo, 3D Reconstruction, etc. Computer Graphics Rendering, 3D Modelling, etc. 15 1970s line labeling intrinsic images pictorial structures stereo correspondence articulated body model optical flow 16 1980s pyramid blending physically-based models shape from shading edge detection regularization-based range data acquisition surface reconstruction and merging 17 1990s factorization-based structure from motion face tracking dense stereo matching image segmentation multi-view reconstruction face recognition 18 2000s image-based rendering image-based modeling texture synthesis Interactive Techniques feature-based recognition region-based recognition 19 State-of-the-art ? http://www.xbox.com/en-US/kinect 20 Why study computer vision ? 21 Why study computer vision ? • Vision is useful • Vision is interesting • Vision is difficult – Half of primate cerebral cortex is devoted to visual processing – Achieving human‐level visual perception is probably “AI‐complete” 22 Challenges: viewpoint variation 23 Challenges: Illumination 24 Challenges: Scale 25 Challenges: Deformation 26 Challenges: Occlusion 27 Challenges: Background cluster 28 Challenges: Motion 29 Challenges: Object intra-class variation 30 Challenges: Local Ambiguities 31 Challenges or opportunities? • Images are confusing, but they also reveal the structure of the world through numerous cues • Our job is to interpret the cues! 32 Computer Vision in the Real World • Special Effects in movie 33 Computer Vision in the Real World • 3D Urban Modeling 34 Computer Vision in the Real World • Microsoft Photosynth (http://labs.live.com/photosynth/) 35 Computer Vision in the Real World • Face Detection 36 Computer Vision in the Real World • Biometrics 37 Computer Vision in the Real World • Optical Character Recognition (OCR) 38 Computer Vision in the Real World • Toys and Robots 39 Computer Vision in the Real World • Games 40 Computer Vision in the Real World • Automotive Safety 41 Computer Vision in the Real World • Vision for robotics, space exploration 42 Goal of this course • Broaden your view about computer vision, but we are not going to study any specific topic in deep • Teaching you how to find computer vision research resources yourself • Understand the research world and teaching you how to be a good computer vision scientist 43 Question ? 44