An Efficient Motion Estimation Scheme with Low Complexity in Video Coding Hyo-sun Yoon Mi-Young Kim(corresponding author) Department of computer science Chonnam national university Gwang-ju, South Korea estheryoon@hotmail.com Department of health and medical Jeonnam Provincial college Damyang, Jeollanamdo, South Korea kimmee@dorip.ac.kr. Abstract—To transmit and to store digital video sequences, compression is necessary. Motion Estimation (ME) Technique is used to redundant data in video sequences. ME which limits the performance of image quality, generated bitrates and encoding time requires huge complexity. To reduce the computational complexity, a Hierarchical motion estimation scheme in Multiview Video Coding (MVC) is proposed. The proposed method exploits the characteristics of the distribution of motion vectors to terminate the motion estimation early and to place the search points in the search area. Experiment results show that the complexity reduction of the proposed method over FS and TZ can be up to 98.9% and 42~78% respectively while maintaining image quality and bitrates. This electronic document is a “live” template and already Keywords—motion estimation; motion vector; multi-view video coding I. INTRODUCTION Digital video is being transmitted over various networks and is stored in various storage devices. To transmit and store digital video, compression is necessary. The international video compression standards exploit motion estimation (ME) technique to remove redundant data in video sequences. ME technique which limits the performance of image quality and the coding speed plays an important role in digital video compression. Recently, there has been a growing interest in 3D TV and free viewpoint video systems. These systems use multi-view video which is obtained by capturing one threedimensional scene with many cameras at different positions. The amount of initial data of multi-view video (MVV) is very huge. To be stored or transmitted to the user, efficient compression technique for MVV is needed. Multi-view video coding (MVC) exploits motion estimation technique to remove temporal redundancy and inter-view redundancy in MVV[1][2]. ME techniques have become one of the most important issues and have attracted much attention. Many motion estimation methods have been proposed. FS which checks every point in search area of the block in reference frame to find best matched point requires huge complexity. Many fast motion estimation methods have been proposed to reduce the computational complexity while maintaining the image quality at the same time. Diamond Search (DS) [3, 4], Hexagon Search (HS) [5], Four Step Search (FSS) [6], Three step search (TSS) [7], New Three Step Search (NTSS) [8] and Predictive Motion Vector Field Adaptive Search Technique (PMVFAST) [9] are well known fast motion estimation methods in H.263 and MPEG 4. UMHexagonS[10] in H.264 is a kind of hierarchical motion search strategy. These search methods [3-10] are generally used for single view video sequence or small size sequences. For multi view video sequence, TZ search algorithm [11] is used. TZ search method performs raster search after initial search. Both raster search and initial search place search points that can cover the overall search area. It implies no need to perform initial search when uiBestDistance is larger than iRaster. It becomes the cause of using the unnecessary computational complexity. To reduce this computational complexity and to maintain the image quality, we proposed an hierarchical motion estimation scheme in multi-view video coding. The proposed search method exploits the characteristics of the distribution of motion vectors to terminate the motion vector search early and to place the search points in the search area of reference frames. This paper is organized as follows. Section 2 describes Pel Block Sesrch (PBS)_and TZ search method in JMVC. The proposed motion estimation scheme is described in Section 3. Section 4 reports simulation results and conclusions are given in Section 5. II. PBS AND TZ Pel Block Search (PBC) and TZ search are the motion estimation algorithms for multi view video in JMVC. Pel Block Search checks all points in search area of reference frames to find the optimal motion vector. However, it requires huge computational complexity. To reduce the complexity of PBS, TZ search is used as a fast motion estimation algorithm. TZ search method uses the search patterns in Fig. 1 and is summarized as follows [12]. Step 1: Motion Vector Prediction - Median predictor, left predictor, up predictor, upper right predictor and (0,0) are used to decide a starting point for next step. Step 2: Initial Grid Search - Run the diamond search with different stride length X in search area[-96, 96] shown in Fig. 1(a)the obtained search center in the previous step. X is 1, 2, 4, 8, 16, 32 or 64. The points shown in Fig. 1.A are tested to This research was supported by Basic Science Research Program through the National Research Foundation of Korea (NRF) funded by the Ministry of Education, Science and Technology (No. 2010-0024120) c 978-1-4799-8389-6/15/$31.00 2015 IEEE 198 decide the point with the minimum SAD. The point with the minimum SAD is taken as a search center for next search. Calculate the distance between the search center and the point with the minimum SAD. The stride length for this minimum SAD point is stored in variable uiBestDistance. If the uiBestDistance is equal to 0, decide the point with the minimum SAD as the motion vector of the block and terminate the motion estimation search. If uiBestDistance is larger than iRaster, go to Step 3. Otherwise go to 4. To reduce the computational complexity of TZ, a new search scheme for motion estimation is proposed. The proposed hierarchical motion estimation search scheme exploits the fact that about 50% ~98% of the motion vector are within a radius 2 pixels around the search origin (0,0) to terminate the motion vector search early and to place the search points in the search area. Step 3: Raster Search - Raster Search is a simple full search on a down-sampled version of the search window [-96, 96]. Run the raster search with raster length 3(iRaster=3) shown in Fig. 1(b)he points on raster search pattern are tested. The point with the minimum SAD is taken as a search center for next search. Step 4: Star Refinement Search - The search center is moved to the point with the minimum SAD obtained from the previous Step. Repeat the operation of the Step 2. The points in this search pattern are tested and decide the point with the minimum SAD. If the uiBestDistance is equal to 0, decide the point with the minimum SAD as the motion vector of the block and terminate the motion estimation search. Otherwise, repeat this step until uiBestDistance =0. (a)multi-diamond search pattern (b) modified diamond search pattern (c)two-points search pattern (d) modified raster search pattern Figure 2. Proposed search patterns. (a) initial grid search pattern (b) raster search pattern Fig. 1. TZ search patterns. III. PROPOSED HIERARCHICAL SEARCH SCHEME There are some problems in TZ. At first, if the motion of the block is large, raster search is performed after initial gird search. Both raster search and initial grid search place search points that can cover the overall search area [-96, 96]. It becomes the cause of using the unnecessary computational complexity. Secondly, TZ search method used star refinement search which is similar to initial grid search. In star refinement search, there is no need to check the points with stride length 4, 8, 16, 32 or 64. Because there is a strong likelihood that the optimal motion vector exists around the point with the minimum SAD obtained previous step. This refinement search is performed repeatedly until uiBestDistance=0. This process needs unnecessary complexity. Lastly, raster search is performed when the motion of the block is large. Raster search of TZ search configures iRaster as 3 in search area [-96, 96] shown in Fig. 1(b). When the motion of the block is large, iRaster has to be larger than 3. This is because refinement search is performed after raster search. The proposed motion estimation scheme is a hierarchical search strategy. It consists of search patterns in Fig.2. Multi diamond search pattern in Fig. 2.(a) is used as a first search pattern to find the motion size of the current block. Modified diamond search pattern in Fig. 2.(b) and two points search pattern in Fig. 2.(c) are used as a last search pattern respective. If the motion of the current block is large, modified raster search pattern in Fig. 2.(d) is carried out. Modified diamond Search pattern as a refinement search pattern is used to find the best motion vector. The proposed search method is summarized as follows. Step 1: Motion Vector Prediction - Median predictor, left predictor, up predictor, upper right predictor and (0, 0) are used to decide a starting point for next step. Step 2: Multi diamond search - Multi diamond search shown in Fig. 2.(a) at the obtained search center in the previous step is carried out. The points (ķĸĹin Fig. 2.(a)) in multi diamond search pattern are tested and the point with the minimum SAD is decided. Calculate the distance between the search center and the point with the minimum SAD. The distance is stored in uiBestDistance. If the uiBestDistance is equal to 0, decide the search center as the motion vector of the block and terminate the motion estimation search. If the uiBestDistance is equal to 1, go to Step 3. If the uiBestDistance is equal to 2, go to Step 5, Otherwise go to Step 4. 2015 IEEE 10th Conference on Industrial Electronics and Applications (ICIEA) 199 Step 3: two points search - Two neighboring points (ĸ points) of the point with the minimum SAD obtained from the previous Step is carried out. The point with the minimum SAD is decided as the motion vector of the block. And then terminate the motion estimation search. . Step 4: Modified raster search - Run modified raster search with raster length 5(iRaster=6) shown in Fig. 2(d). The points on modified raster search pattern are tested. The point with the minimum SAD is taken as a search center for next search center. Step 5: Modified diamond search - Modified diamond Search at the point with the minimum SAD obtained from the previous Step is carried out. The point with the minimum SAD is decided. Calculate the distance between the search center and the point with the minimum SAD. The distance is stored in uiBestDistance. If the uiBestDistance is equal to 0, decide the search center as the motion vector of the block and terminate the motion estimation search. If the uiBestDistance is equal to 1, go to Step 3. If the uiBestDistance is equal to 2, repeat this step . IV. terminate motion estimation search early. Simulation results show that the encoding time for motion estimation is reduced TABLE I. BDPSNR AND BDBR Sequence kbps Exit Uli Ballroom SIMULATION RESULTS In this section, we show the experiment results for the proposed motion estimation method. The simulation is carried out on JMVC reference software version 6.0. Multi-view test sequences are used for the experiment; Exit, Uli and Ballroom. The frame size of Exit and Ballroom is 640*480 and the frame size of Uli is 1024*768. The total number of frames is 100 in each view. QP is varied from 22 through 37 (22, 27, 32 and 37). The search range is 96. We compared TZ to the proposed method in image quality, bit rates and the total encoding time. Table 1 shows the Bjontegarrd Delta (BD) bitrate and BD-PSNR values taken over the four QPs using the Bjntegaard Delta metric. The results shows that there is almost 42(the motion of block is small)~78(the motion of block is large)% reduction in the total encoding time shown in Table 2, while obtaining negligible change in the PSNR and bitrates shown in Table 1 using BDPSNR and BD-bitrates values. In other words, image quality degradation of the proposed method over TZ is about 0.05(the motion of block is small)~2.0(the motion of block is large) (dB) and bitrates increment is about 1.3(the motion of block is small) ~ 62.35(the motion of block is large) (kbps). We also compared FS to the proposed method in image quality, bit rates and the total encoding time. We used Ballroom and QP is 37. Table 3 shows image quality, bitrates and encoding time. There is almost 98.9% reduction in the total encoding time in Table 3 Race1 Flamenco Sequence dB 39.8 848 39.8 27 337 38.0 370 38.2 32 177 36.4 207 36.3 37 105 34.1 128 34.0 22 4148 38.9 4198 38.9 27 2197 36.9 2232 36.9 32 1215 34.4 1242 34.4 37 667 31.6 677 31.6 22 1494 39.0 1596 39.0 27 751 36.8 828 36.7 32 402 34.2 451 34.2 37 226 31.5 261 31.5 22 1024 39.9 1548 39.6 27 582 37.3 814 36.8 32 298 34.6 475 34.0 37 174 31.9 289 31.2 22 1759 41.2 1808 41.2 27 962 38.4 1103 38.3 32 521 35.3 549 35.3 37 278 32.3 299 32.3 QP Exit 27 32 37 22 Uli 27 32 37 22 27 32 37 22 Race1 27 32 37 22 Flamenco 27 32 200 kbps 794 22 CONCLUSION Pel Block Search (PBC) and TZ search are the motion estimation algorithms To reduce computational complexity and maintain the image quality, a new motion estimation search method for multi-view video coding is proposed in this paper. The proposed search method which is a hierarchical search method exploits the characteristics of the distribution of motion vectors to place the search points in the search area and dB 22 BD PSNR BD BR dB (%) -0.2 13.8 -0.05 1.3 -0.4 12.0 -2.0 62.3 -0.4 9.2 TABLE Ċ. COMPARISION VALUES OF ENCODING TIME Ballroom V. Proposed method TZ QP TZ Proposed method T sec sec (%) 77122 69999 64453 60665 386955 355760 324375 298839 83309 78003 72337 67215 215806 199540 176361 154004 69597 65944 61253 41718 38155 36107 34985 113865 106458 101272 96424 45344 42220 40429 37752 46724 43295 40286 37864 27254 26127 24961 -46 -45 -44 -42 -71 -70 -69 -68 -46 -46 -44 -44 -78 -78 -77 -75 -61 -60 -59 2015 IEEE 10th Conference on Industrial Electronics and Applications (ICIEA) 37 60091 23375 -61 TABLE ċ. BITRATES, PSNR AND ENCODING TIME Full Search Ballroom (QP:37) Proposed Method (kbps) (dB) (sec) (kbps) 0 view 295 32.1 417191 309 32.0 (dB) 4497 (sec) 1 view 156 31.2 434363 195 31.0 4853 2 view 245 32.2 427006 308 32.2 4573 3 view 153 31.2 436203 200 30.9 4874 4 view 268 31.5 428528 319 31.4 4623 5 view 171 31.7 434723 219 31.5 4862 6 view 265 31.8 428527 326 31.8 4660 7 view 244 31.1 430623 297 31.0 4810 by 42~78%, with negligible degradation in PSNR and increase in bitrates ACKNOWLEDGMENT This research was supported by Basic Science Research Program through the National Research Foundation of Korea (NRF) funded by the Ministry of Education, Science and Technology (No. 2010-0024120). REFERENCES [1] [2] ISO/IEC JTC1/SC29/WG11 N10357, "Vision on 3D Video", Feb. 2009 Draft ITU-T recommendation and final draft international standard of joint video specification, ITU-T Rec. H.264/ISO/IEC 14496-10 AVC,” in Joint Video Team(JVT) of ISO/IEC/ MPEG and ITU-T VCEG, JVTG050, 2010.. [3] J.Y. Tham, S.Ranganath and A.A.Kassim, A Novel Unretricted CenterBiased Diamond Search Algorithm for Block Motion Estimation, IEEE Transactions on Circuits and Systems for Video Technology. Vol. 8 (1988), 369-377. [4] S.Zhu and K.K.Ma .A New Diamond Search Algorithm for Fast Block Matching Motion, IEEE Transaction on Image Processing Vol. 9 No. 2 (2000), 287-290 [5] Ce Zhu, Xion Lin and Lau-Pui Chau, Hexagon based search pattern for fast block motion estimation, IEEE Transactions on Circuits and Systems for Video Technology. Vol. 12 (2002), 349-355. [6] L.M.Po and W.C.Ma. A Novel Four Search Algorithm for Block Motion Estimation. IEEE Trans. on Circuit and Systems for Video Technology Vol. 6 (1996), 313-317. [7] T.Koga, K.Iinuma, A.Hirano, Y.Ishiguro, Motion compensated interframe coding for video conference, Proc. NTC81(1981),G5.3.15.3.5. [8] R.Li, B.Zeng and M.L.Liou, A New Three Step Search Algorithm for Block Motion Estimation, IEEE Trans. on Circuits and System for Video Technology Vol. 4 No. 4 (1994), 438-441. [9] A. M. Tourapis, O. C. Au, M. L. Liou, Fast Block Matching Motion Estimation using Predictive Motion Vector Field Adaptive Search Technique, ISO/IEC/JCTI/SC29/WG11 MPEG2000/M5866 ( 2000) [10] Z. Chen, P. Zhou, Y. He, Fast Integer pel and Fractional pel motion estimation in for JVT in JVT of ISO/ICE/ MPEG and ITU-T VCEG, 6th meeting (2002) [11] JVT of ISO/IEC MPEG, ITU-T VCEG, MVC software Reference Manual-JMVC8.2, (2010). [12] TANG Xiu-lil, OAT Sheng-kui CAT Can-hui, An Analysis of TZSearch Algorithm in JMVC, International Conference Green Circuits and Systems (ICGCS) (2010), 516 – 520 2015 IEEE 10th Conference on Industrial Electronics and Applications (ICIEA) 201