Introduction of Saliency Map Presenter: Chien-Chi Chen Advisor: Jian-Jiun Ding 1 Outline • Introduction of saliency map • Button-up approach – – – – – – L. Itti’s approach Frequency-tuned Multi-scale contrast Depth of field Spectral Residual approach Global contrast based • Top-down approach – Context-aware • Information maximum – Measuring visual saliency by site entropy rate 2 Outline • Introduction of saliency map • Button-up approach – – – – – – L. Itti’s approach Frequency-tuned Multi-scale contrast Depth of field Spectral Residual approach Global contrast based • Top-down approach – Context-aware • Information maximum – Measuring visual saliency by site entropy rate 3 Introduction of saliency map • Low-level(contrast) – Color Important! – Orientation – Size – Motion – Depth • High-level – People – Context Lowlevel With face detectio n Judd et al, 2009 4 Outline • Introduction of saliency map • Button-up approach – – – – – – L. Itti’s approach Frequency-tuned Multi-scale contrast Depth of field Spectral Residual approach Global contrast based • Top-down approach – Context-aware • Information maximum – Measuring visual saliency by site entropy rate 5 L. Itti’s approach • Architecture: Gaussian Pyramids R,G,B,Y Gabor pyramids for q = {0º, 45º, 90º, 135º} L. Itti’s approach • Center-surround Difference • Achieve center-surround difference through across-scale difference • Operated denoted by Q: Interpolation to finer scale and point-to-point subtraction • One pyramid for each channel: I(s), R(s), G(s), B(s), Y(s) where s [0..8] is the scale L. Itti’s approach • Center-surround Difference – Intensity Feature Maps I(c, s) = | I(c) Q I(s)| c {2, 3, 4} s = c + d where d {3, 4} So I(2, 5) = | I(2) Q I(5)| I(2, 6) = | I(2) Q I(6)| I(3, 6) = | I(3) Q I(6)| … • 6 Feature Maps • • • • L. Itti’s approach • Center-surround Difference Center-surround Difference Orientation Feature Maps •Color Feature Maps • O(c, s,q ) O(c,q ) O(s,q ) Red-Green and Yellow-Blue Same c and s as with intensity +B-Y +R-G +G-R +G-R +R-G +B-Y +Y-B +Y-B +B-Y RG(c, s) = | (R(c) - G(c)) Q (G(s) - R(s)) | BY(c, s) = | (B(c) - Y(c)) Q (Y(s) - B(s)) | L. Itti’s approach • • • Normalization Operator Promotes maps with few strong peaks Surpresses maps with many comparable peaks 1. 2. 3. 4. Normalization of map to range [0…M] Compute average m of all local maxima Find the global maximum M Multiply the map by (M – m)2 L. Itti’s approach Example of Operation: Inhibition of return Frequency-tuned Image Average L I a b S ( x, y) I Ihc ( x, y ) Gaussian blur L hc Ihc ( x, y ) ahc bhc 12 Multi-scale contrast • Saliency algorithm Multi-scale contrast Image Centersurround histogram Conditional Random Field Saliency map Color spatialdistribution 13 Multi-scale contrast Multi-scale contrast • Local summation of laplacian pyramid L fc ( x, I ) || I l ( x) I l ( x) ||2 l 1 xN ( x) Center-surround histogram • Distance between histograms of RGB color: 1 ( Ri Rsi )2 ( R, Rs ) i 2 ( R Rsi ) 2 R* ( x) arg max 2 ( R( x), Rs ( x)) R( x) fh ( x, I ) xx 2 ( R* ( x), Rs* ( x)) {x| xR* ( x)} 14 Multi-scale contrast • Color spatial-distribution The variance of Coordinate of pixel x and y Image(RGB) GMM Distance from pixel x to image center f s ( x, I ) p(c | I x ) (1 V (c)) (1 D(c)) c 15 Multi-scale contrast • Energy term: K E ( A | I ) k Fk (ax , I ) S (ax , ax , I ) x k 1 x, x • Saliency object: f k ( x, I ), ax 0 Fk (ax , I ) 1 f k ( x, I ), ax 1 • Pairwise feature: S (ax , ax , I ) | ax ax | exp( dx, x ) d x, x || I x I x ||, L2norm (2 || I x I x ||2 ) 1 16 Multi-scale contrast • CRF: 1 P( A | I ) exp( E ( A | I )) Z * arg max log P ( An | I n ; ) n • The derivative of the log-likelihood with respect to k 17 Depth of field • As the spread of single lens reflex camera, more and more low depth of field(DOF) images are captured. • However, current saliency detection methods work poorly for the low DOF images. 18 Depth of field • Algorithm: 19 Depth of field • Classification: • Focal Point: In a low DOF image Rectangle with the highest DOG edge density, and center is initial focal point d S (i, j ) S (i, j ) Ae 2s • Composition Analysis: segmentation Sr Sr e Region Ar n d Ais1 ms 2 s 3 20 Spectral Residual Approach • First scaling image to 64x64. • Then we smoothed the saliency map with a gaussian filter g(x) (s = 8). 21 Global contrast-based • Histogram based contrast(Lab): O( N 2 ) O( N ) O( n 2 ) Quantization of Lab Each channel to have 12 different value 85 123 1728 22 Global contrast-based • Region based contrast – Segment the Image – [Efficient graph-based image segmentation] 23 Outline • Introduction of saliency map • Button-up approach – – – – – – L. Itti’s approach Frequency-tuned Center-surround Depth of field Spectral Residual approach Global contrast based • Top-down approach – Context-aware • Information maximum – Measuring visual saliency by site entropy rate 24 Context-Aware • Goal: Convey the image content Liu et al, 2007 25 Context-Aware • Distance between a pair of patches: d ( pi , p j ) dcolor ( pi , p j ) 1 c d position ( pi , p j ) High j salient Context-Aware • Distance between a pair of patches: K 1 r r r Si 1 exp d ( pi , q j ) K k 1 r qk K most similar patches at scale r High for K most similar Saliency Context-Aware • Salient at: – Multiple scales foreground – Few scales background Scale 1 1 Si M Scale 4 rM S r r1 r i Context-Aware • Foci = Si 0.8 • Include distance map 1 d (i) foci Sˆi Si 1 d foci (i ) Si X Outline • Introduction of saliency map • Button-up approach – – – – – – L. Itti’s approach Frequency-tuned Center-surround Depth of field Spectral Residual approach Global contrast based • Top-down approach – Context-aware • Information maximum – Measuring visual saliency by site entropy rate 30 Measuring visual saliency by site entropy rate 1 31 Measuring visual saliency by site entropy rate 2 A fully-connected graph representation is adopted for each 32 Sub-band graph representation 33 Sub-band graph representation 34 Measuring visual saliency by site entropy rate 3 A random walk is adopted on each sub-band graph. And Site entropy rate(SER) is measured the average information from a node to the other 35 The site entropy rate • i Wi ,Wi ij ,W ij 2W j i , j : j i Pij ij ij j • 36 Conclusion • Image processing is funny • Unusual in its neighborhood will correspond to high saliency weight • Contrast is the key of saliency 37 Reference [1] R. Achanta, F. Estrada, P. Wils, and S. S¨usstrunk. Salient region detection and segmentation. In ICVS, pages 66–75. Springer, 2008. 410, 412, 414 [2] R. Achanta, S. Hemami, F. Estrada, and S. S¨usstrunk. Frequency-tuned salient region detection. In CVPR, pages 1597–1604, 2009. 409, 410, 412, 413, 414, 415 [3] L. Itti, C. Koch, and E. Niebur. A model of saliency based visual attention for rapid scene analysis. IEEE TPAMI, 20(11):1254–1259, 1998. 409, 410, 412, 414 [4] X. Hou and L. Zhang. Saliency detection: A spectral residual approach. In CVPR, pages 1–8, 2007. 410, 412, 413, 414 [5] S. Goferman, L. Zelnik-Manor, and A. Tal. Context-aware saliency detection. In CVPR, 2010. 410, 412, 413, 414, 415 [6] MM Cheng, GX Zhang, N. J. Mitra, X. Huang, S.M. Hu. Global Contrast based Salient Region Detect. In CVPR, 2011 . [7] T. Liu, Z. Yuan, J. Sun, J.Wang, N. Zheng, T. X., and S. H.Y. Learning to detect a salient object. IEEE TPAMI, 33(2):353–367, 2011. 410 [8] W. Wang, Y. Wang, Q. Huang, W. Gao, Measuring Visaul Saliency by Site Entropy Rate, In CVPR, 2010. 38