CS 423 (CS 423/CS 523) Computer Vision Lecture 1 INTRODUCTION TO COMPUTER VISION About the Course 2 Syllabus http://vvgl.ozyegin.edu.tr Objective Introduction to the theory, tools, and algorithms of computer vision Instructor Assist. Prof. M. Furkan Kıraç E-mail: furkan.kirac@ozyegin.edu.tr Room: 219 Hours Mondays, 9:40-12:30, Room: 246 Grading Projects: 4x15% Midterm Exam: 40% 3 Grading Projects: Late submissions are not accepted. Copying answers from others’ work is not permitted. Midterm Exam: At least 3 of the 4 Projects must be turned in by the due date in order to qualify for the Final Exam. No Composite Exam (Bütünleme Sınavı), as there is no final exam. 4 Recommended Books Computer Vision: Algorithms and Applications, Richard Szeliski, Springer, 2010. Computer Vision: A Modern Approach, David A. Forsyth and Jean Ponce, Prentice-Hall, 2002. Introductory Techniques for 3D Computer Vision, Emanuele Trucco and Alessandro Verri, Prentice-Hall 1998. 5 OpenCV Resources OpenCV Computer Vision Application Programming Cookbook Second Editon, Robert Laganière, Packt Publishing, 2014. Learning OpenCV, Gary Bradski and Adrian Kaehler, O'Reilly, 2008. Mastering OpenCV with Practical Computer Vision Projects, Daniel Lélis Baggio, et al., Packt Publishing, 2012. 6 Applications of Computer Vision 7 Image Stitching Image Matching Object Recognition 3D Reconstruction Interior Modeling 12 3D Augmented Reality 13 3D Camera Tracking 14 Stereo Conversion for 3DTV 15 Depth Estimation and View Interpolation for 3DTV 16 Human Tracking 17 License Plate Recognition 18 Human Pose Estimation 19 Course Outline 20 Topics to be covered... Linear Filters, Frequency Domain Filtering, Edge and Boundary Detection Feature Detection Fitting, Alignment Histograms Covariance, Principle Component Analysis (PCA) Face Detection and PCA Optical Flow and Motion Tracking and Mean-Shift Randomized Decision Trees, Pose Estimation Bag of Features Context, Two-View Geometry Summary 21 Relation to Other Fields 22 Computer Vision Figure from "Computer Vision: Algorithms and Applications,” Richard Szeliski, Springer, 2010. 23 Computer Graphics Lights and materials Shading Texture mapping Environment effects Animation 3D scene modeling 3D character modeling (OpenGL) 24 Computer Graphics 25 Image Processing Topics Resampling Enhancement Noise filtering Restoration Reconstruction Segmentation Image compression (MATLAB and OpenCV) 26 Image Processing 27 Video Processing Topics Motion estimation Frame-rate conversion Multi-frame noise filtering Multi-frame restoration Super-resolution Video compression (MATLAB & OpenCV) 28 Video acquisition-display chain Capture Representation Coding Transmission Decoding Rendering 29 Human vs. Computer 30 Optical illusions Actual vs. Perceived Intensity (Mach band effect) 32 Brightness Adaptation of the Eye 33 Optical illusions Optical illusions Why is Computer Vision Difficult? Human perception Human perception Human Visual System 40 Human Eye Photoreceptors: Rods & Cones Rods vs. Cones Rods Perceive brightness only Night vision Cones Perceive color Day vision Red, green, and blue cones Cone Distribution Blue is less-focused 64% 32% 2% Visual Threshold drop during Dark Adaptation Spatial Resolution of the Human Eye Photopic (bright-light) vision: Approximately 7 million cones Concentrated around fovea Scotopic (dim-light) vision Approximately 75-150 million rods Distributed over retina (HDTV: 1920x1080 = 2 million pixels) 49 Frequency Responses of Cones Same amount of energy produces different sensations of brightness at different wavelengths Green wavelength contributes most to the perceived brightness. 50 Trichromatic Color Mixing C Any color can be obtained by mixing three primary colors Red, Green, Blue (RGB) with the right proportion T C , k 1, 2 , 3 k k Tk : Tristimulu s values Image Formation 53 Human Eye vs. Camera Camera components Eye components Lens Lens, cornea Shutter Iris, pupil Film Retina Cable to transfer images Optic nerve to send the incident light information to the brain Human Vision Image formation Pin-Hole Camera Model Point Spread Effect Out-of-Focus Blur Shrinking the Aperture Converging Lens Correction with a Converging Lens Perfectly In-Focus for a Certain Distance Only “circle of confusion” Depth-of-Field Depth-of-Field “Sharp Image” within Depth-ofField due to Finite Sensor Size ZF ZN Focal Length (F) and Depth (Z) Z F Y y Y yF Z xF X Z Aperture Size Affects Depth-Of-Field f / 5.6 f / 32 Aperture Ad 2 Camera f-number F f d F A f 2 Exposure Time Motion Blur Effect due to Finite Exposure Time Decrease in aperture implies… Increase in depth-of-field Decrease in motion blur Decrease in exposure 2D Image Representation 75 Image Capture (Courtesy Gonzalez & Woods) 76 Digital Image Capture Digital Image Capture Light sensitive diodes convert photons to electrons Color Image Capture: Single vs. Three CCD Arrays Bayer filter (cheaper but introduces spatial resolution loss) RGB splitter (three separate imaging sensors, higher resolution) Digital Camera Issues Noise Color charge overflowing into neighboring pixels In-camera processing color fringing (chromatic aberration) artifacts from Bayer patterns Blooming caused by low light over-sharpening can produce halos Compression creates blocking artefacts Digitization: Sampling and Quantization Sampling Rate Problem Over Quantization 83 Images as Matrices of Integers (0,0) m 126 127 126 128 127 124 158 125 126 127 123 120 144 163 123 126 125 121 128 155 160 126 123 127 122 142 162 164 120 122 124 130 157 161 166 119 121 123 145 162 164 165 0 → black, 255 → white n 0 ≤ s(m,n) ≤ 255 } quantization 0 ≤ m ≤ M-1 MxN 8-bit gray-scale (intensity, luminance) image 84 0 ≤ n ≤ N-1 sampling Images as Functions We can think of an image as a function, f, from R2 to R: f( x, y ) gives the intensity at position ( x, y ) Realistically, we expect the image only to be defined over a rectangle, with a finite range: • f: [a,b]x[c,d] [0,1] A color image is just three functions pasted together. We can write this as a “vector-valued” function: r ( x, y ) f ( x, y ) g ( x, y ) b( x, y ) RGB Color Bands (Channels) Red Green Blue YUV Bands Also called Y Cb Cr Y : Luma Cb : Chrominance_blue Cr : Chrominance_red Color Y U (Cb) V (Cr ) YUV-RGB Conversion Summary 89 Summary Human visual system Pin-hole camera model Image representation 90