91.204.201 Computing IV Chapter Two: Core Module. The Core Functionality Xinwen Fu References Application Development in Visual Studio Reading assignment: Chapter 2 An online OpenCV Quick Guide with nice examples By Dr. Xinwen Fu CS@UML 2 A few things Blackboard submission Report format Sreenshots By Dr. Xinwen Fu CS@UML 3 Outline 2.1 Mat - The Basic Image Container 2.2 How to scan images, lookup tables and time measurement with OpenCV 2.3 Mask operations on matrices 2.4 Adding (blending) two images using OpenCV 2.5 Changing the contrast and brightness of an image 2.6 Basic drawing 2.7 Random generator and text with OpenCV 2.8 Discrete Fourier Transform 2.9 File input and output using XML and YAML 2.10 Interoperability with OpenCV 1 By Dr. Xinwen Fu CS@UML 4 2.1 Mat - The Basic Image Container We have multiple ways to acquire digital images from the real world: digital cameras, scanners, computed tomography or magnetic resonance imaging to just name a few. In every case what we (humans) see are images. When transforming this to our digital devices what we record are numerical values for each of the points of the image. By Dr. Xinwen Fu CS@UML 5 Storing Images A black-white image is nothing more than a matrix containing all the intensity values of the pixel points. How we get and store the pixels values may vary according to what fits best our need In the end all images inside a computer world may be reduced to numerical matrices and some other information describing the matric itself. OpenCV is a computer vision library whose main focus is to process and manipulate these information to find out further ones. The first thing to learn and get accommodated with is how OpenCV stores and handles images. By Dr. Xinwen Fu CS@UML 6 Mat Basically a class with two data parts: the matrix header (containing information such as the size of the matrix, the method used for storing, at which address is the matrix stored and so on) a pointer to the matrix containing the pixel values (may take any dimensionality depending on the method chosen for storing) OpenCV is an image processing library, doing image processing with its functions By Dr. Xinwen Fu CS@UML 7 Copy Mat OpenCV uses a reference counting system. The idea is that each Mat object has its own header. However the matrix may be shared between two instance of them by having their matrix pointer point to the same address. Copy operators will only copy the headers, and as also copy the pointer to the large matrix too, however not the matrix itself. By Dr. Xinwen Fu CS@UML 8 1. Mat A, C; // creates just the header parts 2. // here we'll know the method used (allocate matrix) A=imread(file, CV_LOAD_IMAGE_COLOR); 3. 4. 5. Mat B(A); // use the copy constructor C=A; // assignment operator All the above objects, in the end point to the same single data matrix. Their headers are different, however making any modification using either one of them will affect all the other ones too. In practice the different objects just provide different access method to the same underlying data. Nevertheless, their header parts are different. By Dr. Xinwen Fu CS@UML 9 Refer only to a subsection of the full data The real interesting part comes that you can create headers that refer only to a subsection of the full data. For example, to create a region of interest (ROI) in an image you just create a new header with the new boundaries: Coordinates of the top-left corner By Dr. Xinwen Fu CS@UML Rectangle width and height 10 Cleaning Mat You may ask if the matrix itself may belong to multiple Mat objects who will take responsibility for its cleaning when it’s no longer needed. The short answer is: the last object that used it. For this a reference counting mechanism is used Whenever somebody copies a header of a Mat object a counter is increased for the matrix. Whenever a header is cleaned this counter is decreased. When the counter reaches zero the matrix too is freed. By Dr. Xinwen Fu CS@UML 11 Copy the matrix itself Because, sometimes you will still want to copy the matrix itself too, there exists the clone() or the copyTo() function. Now modifying F or G will not affect the matrix pointed by the Mat header By Dr. Xinwen Fu CS@UML 12 Tips of using Mat Output image allocation for OpenCV functions is automatic (unless specified otherwise). No need to think about memory freeing with OpenCVs C++ interface. The assignment operator and the copy constructor (ctor)copies only the header. Use the clone() or the copyTo() function to copy the underlying matrix of an image. By Dr. Xinwen Fu CS@UML 13 Storing methods for pixel values. You can select color space and data type used The color space refers to how we combine color components in order to code a given color. The simplest one is the gray scale. For colorful ways we have a lot more of methods to choose from. However, every one of them breaks it down to three or four basic components and the combination of this will give all others. The most popular one is RGB, mainly because this is also how our eye builds up colors in our eyes. Its base colors are red, green and blue. To code the transparency of a color sometimes a fourth element: alpha (A) is added. By Dr. Xinwen Fu CS@UML 14 Color Systems RGB is the most common as our eyes use something similar, used by our display systems. The HSV and HLS decompose colors into their hue, saturation and value/luminance components, which is a more natural way for us to describe colors. You may dismiss last component, making your algorithm less sensible to light conditions of the input image. YCrCb is used by the popular JPEG image format. CIE L*a*b* is a perceptually uniform color space, which comes handy if you need to measure the distance of a given color to another color. By Dr. Xinwen Fu CS@UML 15 Data Types for Color Each of building components has their own valid domains. This leads to the data type used. The smallest data type possible is char, which means one byte or 8 bits In case of three components this gives 16 million possible colors to represent (like in case of RGB) This may be unsigned (so can store values from 0 to 255) or signed (values from -127 to +127). Even finer control by using float (4 byte = 32 bit) or double (8 byte = 64 bit) data types for each component. However, increasing the size of a component also increases the size of the whole picture in the memory. By Dr. Xinwen Fu CS@UML 16 Creating explicitly a Mat object Although Mat is a great class as image container it is also a general matrix class. Therefore, it is possible to create and manipulate multidimensional matrices. You can create a Mat object in multiple ways For two dimensional and multichannel images we first define their size: row and column count wise. By Dr. Xinwen Fu CS@UML 17 We need to specify the data type to use for storing the elements and the number of channels per matrix point. To do this we have multiple definitions made according to the following convention: CV_[The number of bits per item][Signed or Unsigned][Type Prefix]C[The channel number] CV_8UC3 means we use unsigned char types that are 8 bit long and each pixel has three items of this to form the three channels. This are predefined for up to four channel numbers. The Scalar is four element short vector. Only 2 dimension matrix can use cout By Dr. Xinwen Fu CS@UML 18 By Dr. Xinwen Fu CS@UML 19 Data Type A primitive OpenCV data type is one of unsigned char, bool, signed char, unsigned short, signed short, int, float, double, or a tuple of values of one of these types, where all the values in the tuple have the same type. Any primitive type from the list can be defined by an identifier in the form CV_<bit-depth>{U|S|F}C(<number_of_channels>), for example: uchar ~ CV_8UC1, 3-element floating-point tuple ~ CV_32FC3 A universal OpenCV structure that is able to store a single instance of such a primitive data type is Vec. Multiple instances of such a type can be stored in a std::vector, Mat, Mat_, SparseMat, SparseMat_, or any other container that is able to store Vec instances. By Dr. Xinwen Fu CS@UML 20 By Dr. Xinwen Fu CS@UML 21 Print for other common items OpenCV offers support for print of other common OpenCV data structures too via the << operator like 2D Point 3D Point std::vector via cv::Mat By Dr. Xinwen Fu CS@UML 22 Print (Cont’d) std::vector of points By Dr. Xinwen Fu CS@UML 23 Outline 2.1 Mat - The Basic Image Container 2.2 How to scan images, lookup tables and time measurement with OpenCV 2.3 Mask operations on matrices 2.4 Adding (blending) two images using OpenCV 2.5 Changing the contrast and brightness of an image 2.6 Basic drawing 2.7 Random generator and text with OpenCV 2.8 Discrete Fourier Transform 2.9 File input and output using XML and YAML 2.10 Interoperability with OpenCV 1 By Dr. Xinwen Fu CS@UML 24 Goal How to go through each and every pixel of an image? How is OpenCV matrix values stored? How to measure the performance of our algorithm? What are lookup tables and why use them? By Dr. Xinwen Fu CS@UML 25 Color space reduction Divide the color space current value with a new input value to end up with fewer colors For instance every value between zero and nine takes the new value zero, every value between ten and nineteen the value ten and so on. By Dr. Xinwen Fu CS@UML 26 How the image matrix is stored in the memory? - gray scale image The size of the matrix depends of the color system used. More accurately, it depends from the number of channels used. By Dr. Xinwen Fu CS@UML 27 How the image matrix is stored in the memory? - RGB color system For multichannel images the columns contain as many sub columns as the number of channels Note that the order of the channels is inverse: BGR instead of RGB By Dr. Xinwen Fu CS@UML 28 Color reduction formula When you divide an uchar (unsigned char - aka values between zero and 255) value with an int value the result will be also char. These values may only be char values. Therefore, any fraction will be rounded down. Taking advantage of this fact the upper operation in the uchar domain may be expressed as: By Dr. Xinwen Fu CS@UML 29 Measure time code runs Another issue is how do we measure time? OpenCV offers two simple functions to achieve this getTickCount() and getTickFrequency(). The first returns the number of ticks of your systems CPU from a certain event (like since you booted your system). The second returns how many times your CPU emits a tick during a second. So to measure in seconds the number of time elapsed between two operations is easy as: By Dr. Xinwen Fu CS@UML 30 Lookup table for color reduction how_to_scan_images imageName.jpg intValueToReduce [G] The final argument is optional. If given the image will be loaded in gray scale format, otherwise the RGB color way is used. The first thing is to calculate the lookup table. By Dr. Xinwen Fu CS@UML 31 The Efficient Way By Dr. Xinwen Fu CS@UML 32 Iterator By Dr. Xinwen Fu CS@UML 33 On-the-fly address calc In case of color images we have three uchar items per column. This may be considered a short vector of uchar items, that has been baptized in OpenCV with the Vec3b name. By Dr. Xinwen Fu CS@UML 34 On-the-fly address calc By Dr. Xinwen Fu CS@UML 35 The Core Function By Dr. Xinwen Fu CS@UML 36 Performance Difference For the best result compile the program and run it on your own speed. For showing off better the differences I’ve used a quite large (2560 X 1600) image. The performance presented here are for color images. For a more accurate value I’ve averaged the value I got from the call of the function for hundred times. By Dr. Xinwen Fu CS@UML 37 Mat_ If you need multiple lookups using this method for an image it may be troublesome and time consuming to enter the type and the at keyword for each of the accesses. To solve this problem OpenCV has a Mat_ data type. It’s the same as Mat with the extra need that at definition you need to specify the data type through what to look at the data matrix, however in return you can use the operator() for fast access of items. To make things even better this is easily convertible from and to the usual Mat data type. A sample usage of this you can see in case of the color images of the upper function. Nevertheless, it’s important to note that the same operation (with the same runtime speed) could have been done with the at() function. It’s just a less to write for the lazy programmer trick. By Dr. Xinwen Fu CS@UML 38 Outline 2.1 Mat - The Basic Image Container 2.2 How to scan images, lookup tables and time measurement with OpenCV 2.3 Mask operations on matrices 2.4 Adding (blending) two images using OpenCV 2.5 Changing the contrast and brightness of an image 2.6 Basic drawing 2.7 Random generator and text with OpenCV 2.8 Discrete Fourier Transform 2.9 File input and output using XML and YAML 2.10 Interoperability with OpenCV 1 By Dr. Xinwen Fu CS@UML 39 Mask operation Recalculate each pixels value in an image according to a mask matrix (also known as kernel). This mask holds values that will adjust how much influence neighboring pixels (and the current pixel) have on the new pixel value. From a mathematical point of view we make a weighted average, with our specified values. By Dr. Xinwen Fu CS@UML 40 Basic Method 1. 2. 3. void Sharpen(const Mat& myImage,Mat& Result) { CV_Assert(myImage.depth() == CV_8U); // accept only uchar images 4. 5. 6. 7. 8. 9. 10. 11. const int nChannels = myImage.channels(); Result.create(myImage.size(),myImage.type()); for(int j = 1 ; j < myImage.rows-1; ++j) { const uchar* previous = myImage.ptr<uchar>(j - 1); const uchar* current = myImage.ptr<uchar>(j ); const uchar* next = myImage.ptr<uchar>(j + 1); uchar* output = Result.ptr<uchar>(j); By Dr. Xinwen Fu CS@UML 41 12. for(int i= nChannels;i < nChannels*(myImage.cols-1); ++i) 13. { *output++ = saturate_cast<uchar>(5*current[i] 14. -current[i-nChannels] - current[i+nChannels] - previous[i] - next[i]); 15. } 16. 17. } 18. Result.row(0).setTo(Scalar(0)); 19. Result.row(Result.rows-1).setTo(Scalar(0)); 20. Result.col(0).setTo(Scalar(0)); 21. Result.col(Result.cols-1).setTo(Scalar(0)); 22. } By Dr. Xinwen Fu CS@UML 42 filter2D function Applying such filters is so common in image processing that in OpenCV there exist a function that will take care of applying the mask (also called a kernel in some places). 1. define a Mat object that holds the mask: 2. Then call the filter2D function specifying the input, the output image and the kernell to use: By Dr. Xinwen Fu CS@UML 43 Outline 2.1 Mat - The Basic Image Container 2.2 How to scan images, lookup tables and time measurement with OpenCV 2.3 Mask operations on matrices 2.4 Adding (blending) two images using OpenCV 2.5 Changing the contrast and brightness of an image 2.6 Basic drawing 2.7 Random generator and text with OpenCV 2.8 Discrete Fourier Transform 2.9 File input and output using XML and YAML 2.10 Interoperability with OpenCV 1 By Dr. Xinwen Fu CS@UML 44 Theory From our previous tutorial, we know already a bit of Pixel operators. An interesting dyadic (two-input) operator is the linear blend operator: By varying from 0 to 1 this operator can be used to perform a temporal crossdissolve between two images or videos, as seen in slide shows and film productions By Dr. Xinwen Fu CS@UML 45 Example 1. 2. 3. 4. 5. 6. 7. 8. #include <opencv/cv.h> #include <opencv/highgui.h> #include <iostream> using namespace cv; int main( int argc, char** argv ) { double alpha = 0.5; double beta; double input; Mat src1, src2, dst; 9. 10. 11. 12. 13. 14. /// Ask the user enter alpha std::cout<<" Simple Linear Blender "<<std::endl; std::cout<<"-----------------------"<<std::endl; std::cout<<"* Enter alpha [0-1]: "; std::cin>>input; By Dr. Xinwen Fu CS@UML 46 /// We use the alpha provided by the user iff it is between 0 and 1 if( alpha >= 0 && alpha <= 1 ) { alpha = input; } 15. Example 16. /// Read image ( same size, same type ) src1 = imread("../../images/LinuxLogo.jpg"); src2 = imread("../../images/WindowsLogo.jpg"); if( !src1.data ) { printf("Error loading src1 \n"); return -1; } if( !src2.data ) { printf("Error loading src2 \n"); return -1; } 17. 18. 19. 20. 21. 25. /// Create Windows namedWindow("Linear Blend", 1); beta = ( 1.0 - alpha ); addWeighted( src1, alpha, src2, beta, 0.0, dst); 26. imshow( "Linear Blend", dst ); 27. waitKey(0); return 0; 22. 23. 24. 28. 29. } By Dr. Xinwen Fu CS@UML 47 Outline 2.1 Mat - The Basic Image Container 2.2 How to scan images, lookup tables and time measurement with OpenCV 2.3 Mask operations on matrices 2.4 Adding (blending) two images using OpenCV 2.5 Changing the contrast and brightness of an image 2.6 Basic drawing 2.7 Random generator and text with OpenCV 2.8 Discrete Fourier Transform 2.9 File input and output using XML and YAML 2.10 Interoperability with OpenCV 1 By Dr. Xinwen Fu CS@UML 48 Image Processing A general image processing operator is a function that takes one or more input images and produces an output image. Image transforms can be seen as: Point operators (pixel transforms) Neighborhood (area-based) operators By Dr. Xinwen Fu CS@UML 49 Pixel Transforms In this kind of image processing transform, each output pixel’s value depends on only the corresponding input pixel value (plus, potentially, some globally collected information or parameters). Examples of such operators include brightness and contrast adjustments as well as color correction and transformations. By Dr. Xinwen Fu CS@UML 50 Brightness and contrast adjustments Two commonly used point processes are multiplication and addition with a constant: The parameters α > 0 and β are often called the gain and bias parameters g(x) = α f(x) + β Sometimes these parameters are said to control contrast and brightness respectively. You can think of f(x) as the source image pixels and g(x) as the output image pixels. Then, more conveniently we can write the expression as: g(i; j) = α f(i; j) + β where i and j indicates that the pixel is located in the ith row and j-th column. By Dr. Xinwen Fu CS@UML 51 Example 3. #include <opencv/cv.h> #include <opencv/highgui.h> #include <iostream> 4. using namespace cv; 5. double alpha; /**< Simple contrast control */ int beta; /**< Simple brightness control */ 1. 2. 6. 7. 8. 9. 10. 11. 12. 13. 14. 15. 16. int main( int argc, char** argv ) { /// Read image given by user Mat image = imread( argv[1] ); Mat new_image = Mat::zeros( image.size(), image.type() ); /// Initialize values std::cout<<" Basic Linear Transforms "<<std::endl; std::cout<<"-------------------------"<<std::endl; std::cout<<"* Enter the alpha value [1.0-3.0]: ";std::cin>>alpha; std::cout<<"* Enter the beta value [0-100]: "; std::cin>>beta; CS@UML By Dr. Xinwen Fu 52 17. 18. 19. 20. 21. 22. 23. 24. 25. 26. 27. 28. 29. 30. 31. 32. 33. 34. /// Do the operation new_image(i,j) = alpha*image(i,j) + beta for( int y = 0; y < image.rows; y++ ) { for( int x = 0; x < image.cols; x++ ) { for( int c = 0; c < 3; c++ ) { new_image.at<Vec3b>(y,x)[c] = saturate_cast<uchar>( alpha*( image.at<Vec3b>(y,x)[c] ) + beta ); } } } /// Create Windows namedWindow("Original Image", 1); namedWindow("New Image", 1); /// Show stuff imshow("Original Image", image); imshow("New Image", new_image); /// Wait until user press some key 36. waitKey(); 37. return 0; CS@UML 38. } 35. By Dr. Xinwen Fu 53 Who is Lena? Lena Söderberg, a Swedish model cropped from the centerfold of November 1972 issue of Playboy magazine By Dr. Xinwen Fu CS@UML 54 Core function Instead of using the for loops to access each pixel, we could have simply used this command: image.convertTo(new_image, -1, alpha, beta); where convertTo would effectively perform new_image = a*image + beta. However, we wanted to show you how to access each pixel. In any case, both methods give the same result. By Dr. Xinwen Fu CS@UML 55 Outline 2.1 Mat - The Basic Image Container 2.2 How to scan images, lookup tables and time measurement with OpenCV 2.3 Mask operations on matrices 2.4 Adding (blending) two images using OpenCV 2.5 Changing the contrast and brightness of an image 2.6 Basic drawing 2.7 Random generator and text with OpenCV 2.8 Discrete Fourier Transform 2.9 File input and output using XML and YAML 2.10 Interoperability with OpenCV 1 By Dr. Xinwen Fu CS@UML 56 Example in Visual Studio 2010 Use Point to define 2D points in an image. Use Scalar and why it is useful Draw a line by using the OpenCV function line Draw an ellipse by using the OpenCV function ellipse Draw a rectangle by using the OpenCV function rectangle Draw a circle by using the OpenCV function circle Draw a filled polygon by using the OpenCV function fillPoly By Dr. Xinwen Fu CS@UML 57 Outline 2.1 Mat - The Basic Image Container 2.2 How to scan images, lookup tables and time measurement with OpenCV 2.3 Mask operations on matrices 2.4 Adding (blending) two images using OpenCV 2.5 Changing the contrast and brightness of an image 2.6 Basic drawing 2.7 Random generator and text with OpenCV 2.8 Discrete Fourier Transform 2.9 File input and output using XML and YAML 2.10 Interoperability with OpenCV 1 By Dr. Xinwen Fu CS@UML 58 Example in Visual Studio 2010 Use the Random Number generator class (RNG) and how to get a random number from a uniform distribution. RNG rng( 0xFFFFFFFF ); rng.uniform(a,b); // This generates a randomly uniformed distribution between the values a and b (inclusive in a, exclusive in b). Display text on an OpenCV window by using the function putText By Dr. Xinwen Fu CS@UML 59 Outline 2.1 Mat - The Basic Image Container 2.2 How to scan images, lookup tables and time measurement with OpenCV 2.3 Mask operations on matrices 2.4 Adding (blending) two images using OpenCV 2.5 Changing the contrast and brightness of an image 2.6 Basic drawing 2.7 Random generator and text with OpenCV 2.8 Discrete Fourier Transform 2.9 File input and output using XML and YAML 2.10 Interoperability with OpenCV 1 By Dr. Xinwen Fu CS@UML 60 Skipped What is a Fourier transform and why use it? How to do it in OpenCV? Usage of functions such as: copyMakeBorder(), merge(), dft(), getOptimalDFTSize(), log() and normalize() . By Dr. Xinwen Fu CS@UML 61 Outline 2.1 Mat - The Basic Image Container 2.2 How to scan images, lookup tables and time measurement with OpenCV 2.3 Mask operations on matrices 2.4 Adding (blending) two images using OpenCV 2.5 Changing the contrast and brightness of an image 2.6 Basic drawing 2.7 Random generator and text with OpenCV 2.8 Discrete Fourier Transform 2.9 File input and output using XML and YAML 2.10 Interoperability with OpenCV 1 By Dr. Xinwen Fu CS@UML 62 Skipped How to print and read text entries to a file and OpenCV using YAML or XML files? How to do the same for OpenCV data structures? How to do this for your data structures? Usage of OpenCV data structures such as FileStorage, FileNode or FileNodeIterator. By Dr. Xinwen Fu CS@UML 63 Outline 2.1 Mat - The Basic Image Container 2.2 How to scan images, lookup tables and time measurement with OpenCV 2.3 Mask operations on matrices 2.4 Adding (blending) two images using OpenCV 2.5 Changing the contrast and brightness of an image 2.6 Basic drawing 2.7 Random generator and text with OpenCV 2.8 Discrete Fourier Transform 2.9 File input and output using XML and YAML 2.10 Interoperability with OpenCV 1 By Dr. Xinwen Fu CS@UML 64 Skipped What changed with the version 2 of OpenCV in the way you use the library compared to its first version How to add some Gaussian noise to an image What are lookup tables and why use them? By Dr. Xinwen Fu CS@UML 65 References OpenCV documentation By Dr. Xinwen Fu CS@UML 66