Early search termination for fast motion estimation
- Yun-Gu Lee^{1}Email author
https://doi.org/10.1186/s13640-015-0083-4
© Lee. 2015
Received: 13 February 2015
Accepted: 11 August 2015
Published: 2 September 2015
Abstract
This paper proposes a novel method for early termination of a motion search. It first introduces an estimator of the sum of absolute difference (SAD) between the current block and a search point. After analyzing the SAD estimator, it proposes the SAD condition to decide whether the current search point is around a minimum point or not. The counter condition is used to evaluate the current search point as an early search termination cue. After the current search point is considered as a minimum point, the search is stopped immediately. The proposed algorithm can easily be combined with most existing fast motion estimation algorithms to further reduce computational cost. While previous thresholding techniques have focused on the correlation of SAD values between a current block and its neighboring blocks, the proposed algorithm studies its own characteristics to predict the threshold. Hence, the proposed termination scheme can complement the previous early search termination techniques by merging the termination methods together, in order to further reduce computational cost. Experimental results demonstrate that the proposed algorithm successfully reduces the computational cost by combining the previous early search termination techniques.
Keywords
Fast motion estimation Block matching Early search termination Adaptive threshold1 Introduction
Recent video coding standards, such as H.264 [1] and HEVC [2], significantly improve coding efficiency, compared to previous video coding standards. Complexity is also increased along with improvement of the coding efficiency. The video coding standards have adopted the block matching algorithm (BMA) to reduce temporal redundancy between frames. In the BMA, the best matching block of each current block is found in the reference frame. Then, only the difference block between the matching block and the current block is coded with a motion vector (or a displacement). The straightforward method to find the best matching block is a full search block matching algorithm that checks all possible candidates in the search area, which requires a huge computational burden. Moreover, since recent video coding standards basically support variable block size and multiple reference frames, the number of candidates is dramatically increasing. Hence, motion estimation is the most time-consuming part among the video coding tools used in a video encoder. Motion estimation in video coding first finds the integer motion vector within the search area. Then, the subpixel motion vector is found around the integer motion vector. The computational burden for estimating the integer motion vector is much more than that for estimating the subpixel motion vector. So, this paper considers only integer motion estimation.
Fast motion estimation algorithms have been developed to reduce the heavy computational cost of the full search block matching algorithm, such as diamond search [3], MVFAST [4], PMVFAST [5], hexagon-based search [6], and hybrid unsymmetrical cross multihexagon-grid search [7]. In order to find the motion vector, these fast motion estimation algorithms check only limited search points using different shapes and sizes of search patterns, rather than examining all possible search points within a search area. Complex or large search patterns are robust to random motions, but they are expensive. Simple search patterns sometimes fail to find the global minimum point by falling into a local minimum one. Hence, advanced algorithms adaptively select shapes and sizes of search patterns. For example, the large and sparse search pattern is first adopted to prevent from falling into the local minimum point. Then, the small and dense search pattern is used to refine the search.
Early search termination techniques have been developed to further reduce the computational cost of fast motion estimation. Usually, a condition of early search termination is to compare the sum of absolute difference (SAD) value of a current search point with the threshold value. If the SAD value is less than the threshold value, the current search point is considered as the minimum point, and the search is immediately stopped without examining the remaining search points. Ismail mentioned that early stop search termination of a search process could reduce the number of computations [8]. In [4], a constant threshold was used for early search termination. Tourapis proposed an adaptive method in that the threshold value of the current block is determined among the minimum SAD values of the three adjacent blocks [5]. Ismail [9] proposed a dynamic early stop search termination (DESST) by considering a SAD value of an initial search center (ISC) of a current block and the average SAD values of ISCs in all the previous coded blocks. These methods determine the threshold value from SAD values of blocks spatially and temporally adjacent to the current block. Since motion fields of consecutive frames are smooth and gentle [9], they can efficiently judge a block to be stationary or not using the determined threshold. For the stationary blocks, they can terminate early the motion search. However, for the non-stationary blocks, the threshold value predicted from SAD values of blocks spatially and temporally neighboring to the current block is inadequate. Hence, while these methods significantly increase the performance, the determined threshold value may not be the best for every block.
In our previous work, a method to predict the distance between the optimal point (a position of the best matching block) and a current search point was proposed [10]. If the predicted distance between them is less than 0.5 pixel, the current search point is considered as the global minimum point, and the motion search was terminated early. However, since the method considered a one-dimensional image, the predicted distance may not be accurate. Also, there was no analysis of the distance measure in terms of performance and accuracy, so it was very difficult to combine the proposed distance measure with the existing motion estimation algorithms, in order to terminate a search early. Therefore, this paper proposes the new early search termination for fast motion estimation by extending our previous work to a two-dimensional space. It proposes a SAD estimator on the two-dimensional space and studies it to get a condition for early termination of a motion search. This paper also analyzes the proposed algorithm in terms of performance and accuracy.
While conventional early termination methods consider the SAD values of blocks spatially and temporally neighboring to a current block to determine termination conditions, the proposed algorithm takes into account a self-characteristic of the current block for a termination condition. Since the proposed algorithm utilizes different characteristics compared to the existing methods, the proposed termination method and the existing termination methods can complement each other, in order to further improve the speed. This paper provides how the proposed method complements previous thresholding techniques.
This paper is organized as follows. Section 2 describes the proposed algorithm. Experimental results are given in Section 3. Finally, conclusions are presented in Section 4.
2 Proposed method
2.1 SAD estimator
Here, G ^{ H }(i,j) and G ^{ V }(i,j) are (c(i,j)−c(i+1,j)) and (c(i,j)−c(i,j+1)), respectively. Equation (2) can be interpreted as a SAD estimator according to (m, n). If either of m or n is zero, Eq. (2) becomes identical to the equation in the previous work [10], which considers an only one-dimensional case.
2.2 SAD contours
Let (d _{ x }, d _{ y }) be the distance along the x and y axes between the current search point and the optimal motion vector. In the following, this paper assumes that both d _{ x } and d _{ y } are always positive for simplicity. When both d _{ x } and d _{ y } are less than 1 pixel for a given integer search point, (d _{ x }, d _{ y }) is a subpel component of the optimal motion vector. Then, (d _{ x }, d _{ y }) is the same as (m, n), and the SAD value for a given (m, n) can be described by Eq. (2).
2.3 Termination condition
Subpixel motion estimation finds the optimal motion vector with subpixel accuracy around the integer motion vector. Usually, the search range for the subpixel motion estimation is [ −1,1] along the x and y directions. So if (d _{ x }, d _{ y }) for a current integer search point is less than 1 pixel along the x and y directions or the current integer search point is within the rectangle, the current search point should be the integer motion vector. Hence, the integer motion estimation can be stopped immediately, which is the early termination of a motion search. This subsection will study the condition wherein (d _{ x }, d _{ y }) is less than 1 pixel.
Let SAD values of points along contour A in Fig. 2 a be 1000. If the SAD value of a current integer search point is less than 1000, the current search point should be inside of contour A. Otherwise, the current integer search point is outside of contour A. In this way, the relative position of the current integer search point can be roughly predicted by comparing the SAD values of the current integer search point and contours. If a contour is inside of the rectangle and the SAD value of a search point is less than that for the contour, (d _{ x }, d _{ y }) of the search point will be less than 1 pixel. However, although a contour is inside of the rectangle, if a SAD value is more than that for the contour, it is not certain that (d _{ x }, d _{ y }) is less than 1 pixel. For example, let point C be a current integer search point. In this case, the SAD value of the search point is more than that for contour A, and the search point is not inside of the rectangle. On the other hand, if point D is a current integer search point, although the SAD value of the search point is more than that for contour A, the search point is inside of the rectangle. Therefore, only if the SAD value of a search point is less than that for the contour that is inside of the rectangle, (d _{ x }, d _{ y }) for the search point is the inside of the rectangle.
Consequently, if the SAD value of the current search point is less than the threshold, integer motion estimation is terminated. It is noted here that SAD^{(a,b)}(0,1) and SAD^{(a,b)}(1,0) are calculated from only pixels in a current block.
Compared to Fig. 3, the sizes of the gray regions for cases with SAD(0,1)<SAD(1,0) and SAD(0,1)>SAD(1,0) are increased. Hence, the probability of early termination of a motion search will be increased. However, since parts of the gray regions are not included in the rectangular boxes, the search will be sometimes terminated at the improper position, and the quality performance may be degraded. Section 4 will present the performance degradation from wrong termination of a motion search by comparing the performances of Cond _{ MinSad } and Cond _{ MaxSad }.
Here, B _{W} and B _{H} are the width and height of a block to be considered. Cond _{ M i n S a d,S i m } sets a constant threshold value to 2B _{W} B _{H} to support the variable block size mode. Note here that the threshold value for MVFAST [4] was set to 512. The performance of Cond _{ M i n S a d,S i m } will be given in the next section.
2.4 Implementation
To support variable block motion estimation, the threshold value for the termination condition does not need to be calculated at every block size. The computational cost is reduced by sharing the calculated data. First, the method calculates SAD(1,0) and SAD(0,1) of 4×4 blocks in a current block in Fig. 5 a. Then, SAD(1,0) and SAD(0,1) for the given block size are calculated from the above 4×4 blocks (see Fig. 5 b). Finally, the threshold is calculated according to the termination condition, such as Cond _{ MinSad }, Cond _{ MaxSad }, and Cond _{ M i n S a d,S i m }.
Then, the overhead becomes a 0.5 point search.
Some assumptions, such as the pure translational motion, the reconstruction of a pixel using a simple bilinear interpolation, and convex contours of SAD, are not valid in general. Hence, the proposed algorithm may improve speed with a sacrifice of rate (or peak signal-to-noise ratio (PSNR)) performance. Therefore, experimental results in the next section will show the performance of the proposed algorithm in terms of speed and rate.
3 Experimental results
In order to verify the proposed algorithm, two types of experiments are performed. If a search point satisfies the proposed condition while the search point is not very close to the optimal point (or the integer motion vector), the proposed method terminates a motion search at the non-optimal point. Then, there is a chance of degrading the coding performance. Hence, the first experiment examines the accuracy and safety of the proposed termination condition. The second experiment verifies the performance of the proposed algorithm on a video encoder.
Test video sequences
Sequence | Resolution | Frame rate (fps) | Number of frames |
---|---|---|---|
BQTerrace (BQT) | 1920 × 1080 | 60 | 100 |
BasketballDrive (BD) | 1920 × 1080 | 50 | 100 |
Kimono1 (K) | 1920 × 1080 | 24 | 100 |
ChinaSpeed (CS) | 1024 × 768 | 30 | 100 |
ParkScene (P) | 1920 × 1080 | 24 | 100 |
FourPeople (FP) | 1280 × 720 | 60 | 100 |
KristenAndSara (KS) | 1280 × 720 | 60 | 100 |
PeopleOnStreet (PS) | 2560 × 1600 | 30 | 100 |
3.1 Accuracy test
In order to examine the accuracy and safety of the proposed termination condition of Cond _{ MinSad }, SAD values for all possible search points for each block are examined and compared with the condition Cond _{ MinSad }. When among all search points, only search points within an area close to the optimal point satisfy the termination condition, a block is marked as a safe block. Here, the distance between the optimal point (or the integer motion vector) and a search point within the close area should be equal to or less than 1 pixel. If one of the search points outside of the close area satisfies the termination condition, the block is marked as an unsafe block. If the current search point is out of the close area while the point satisfies the proposed termination condition, the search is stopped at the current search point, and the coding performance may be degraded. Hence, the number of safe blocks should be high, while the number of unsafe blocks is low. It should be noted here that the block marked as the unsafe block does not always degrade the coding efficiency. For example, although the block is the unsafe block, if the current search point is within the close area, the search is stopped immediately, and the coding performance will not be degraded. The proposed termination condition Cond _{ MinSad } for the accuracy test is given in Eq. (5).
Accuracy test
Sequence | Method | Safe block | Unsafe block | Total |
---|---|---|---|---|
BQT | Cond _{ MinSad } | 36.48 % | 26.03 % | 62.51 % |
Ismail [9] | 3.30 % | 28.68 % | 31.98 % | |
BD | Cond _{ MinSad } | 15.03 % | 21.78 % | 36.81 % |
Ismail [9] | 6.11 % | 54.69 % | 60.80 % | |
K | Cond _{ MinSad } | 25.84 % | 6.57 % | 32.41 % |
Ismail [9] | 24.94 % | 30.59 % | 55.53 % | |
CS | Cond _{ MinSad } | 38.45 % | 12.85 % | 51.30 % |
Ismail [9] | 17.95 % | 46.07 % | 64.02 % | |
P | Cond _{ MinSad } | 64.36 % | 7.99 % | 72.35 % |
Ismail [9] | 21.50 % | 22.85 % | 44.35 % | |
FP | Cond _{ MinSad } | 48.21 % | 7.28 % | 55.49 % |
Ismail [9] | 48.74 % | 38.72 % | 87.46 % | |
KS | Cond _{ MinSad } | 25.34 % | 8.45 % | 33.79 % |
Ismail [9] | 30.70 % | 54.05 % | 84.75 % | |
PS | Cond _{ MinSad } | 38.17 % | 5.43 % | 43.60 % |
Ismail [9] | 25.36 % | 9.33 % | 34.69 % | |
Avg | Cond _{ MinSad } | 32.49 % | 12.05 % | 48.54 % |
Ismail [9] | 22.33 % | 35.62 % | 57.95 % |
where SAD _{ ISCavg } is the average SAD _{ ISC } value for all the previous coded blocks in the current frame whose best match motion vectors are the initial search centers (ISCs) themselves. SAD _{ ISCcurrent } is the value of SAD at the initial search center for the current block. Here, ε _{1} and ε _{2} are set to 0.75 and 128, respectively, which are the recommended values [9].
In the table, the percentages of the safe block of the proposed algorithm are similar to or higher than those of Ismail [9]. Meanwhile, the percentages of the unsafe block of the proposed algorithm are less. This means that the proposed algorithm may be slightly more accurate than Ismail’s method.
The proposed algorithm can be combined with most existing fast motion estimation algorithms to reduce computational cost. Since the fast motion estimation algorithms adopt different shapes and sizes of search patterns, the overall speed will totally depend on the algorithms. Hence, “Total” in Table 2 is related with only the speed gain by combining the termination method. It represents the percentages of blocks in that at least one search point satisfies the termination condition. There is a chance to terminate early a motion search at those blocks. The higher the percentages of “Total,” the higher the speed gain is. Overall, the total percentage of Ismail’s method is 9.41 % more than that of the proposed method, and Ismail’s method provides the higher speed gain than the proposed algorithm. However, since the proposed algorithm utilizes a different feature to predict the termination condition, the proposed algorithm can be combined with Ismail’s method to further improve the search speed. (The experimental results will be presented in the next subsection.) Hence, the proposed method is still meaningful.
3.2 Performance on a video encoder
3.2.1 Simulation environment
Here, METime _{ CONV } and METime _{ NEW } represent motion estimation times for conventional and newly implemented methods, respectively.
3.2.2 Results
Performance of the proposed algorithm by combining with a conventional motion estimation algorithm
Sequence | QP | Conventional method | Cond _{ MinSad } | Cond _{ MaxSad } | ||||||
---|---|---|---|---|---|---|---|---|---|---|
PSNR | BR | ME time | ΔPSNR | ΔBR | RR | ΔPSNR | ΔBR | RR | ||
(dB) | (Mbps) | (ms) | (dB) | (%) | (dB) | (%) | ||||
BQT | 12 | 50.34 | 380.91 | 20.81 | 0.01 | 0.08 | 0.48 | 0.03 | 0.18 | 0.75 |
18 | 45.12 | 216.47 | 19.95 | 0.00 | 0.23 | 0.47 | 0.01 | 0.50 | 0.74 | |
24 | 39.21 | 90.54 | 18.05 | 0.00 | 0.59 | 0.49 | 0.00 | 1.06 | 0.74 | |
30 | 35.09 | 20.84 | 15.57 | 0.00 | 1.22 | 0.44 | −0.01 | 1.92 | 0.71 | |
BD | 12 | 50.08 | 283.53 | 27.91 | 0.01 | −0.01 | 0.22 | 0.03 | −0.02 | 0.52 |
18 | 44.31 | 138.69 | 27.05 | 0.01 | 0.15 | 0.22 | 0.01 | 0.37 | 0.51 | |
24 | 39.65 | 35.65 | 24.31 | 0.00 | 0.22 | 0.25 | −0.01 | 0.85 | 0.51 | |
30 | 37.49 | 12.01 | 21.96 | 0.00 | 0.15 | 0.14 | 0.00 | 1.21 | 0.43 | |
K | 12 | 49.37 | 109.12 | 29.94 | 0.00 | 0.02 | 0.16 | 0.00 | 0.06 | 0.38 |
18 | 43.94 | 36.06 | 29.51 | −0.01 | 0.12 | 0.17 | −0.01 | 0.28 | 0.39 | |
24 | 41.86 | 10.89 | 27.57 | 0.00 | 0.06 | 0.16 | −0.01 | 0.34 | 0.35 | |
30 | 39.95 | 4.85 | 24.86 | 0.00 | −0.02 | 0.07 | −0.01 | 0.30 | 0.24 | |
CP | 12 | 51.44 | 32.19 | 10.87 | 0.01 | 0.52 | 0.10 | 0.03 | 1.75 | 0.33 |
18 | 47.21 | 18.24 | 10.59 | 0.00 | 0.64 | 0.10 | −0.01 | 2.58 | 0.32 | |
24 | 42.81 | 9.93 | 10.16 | 0.00 | 0.80 | 0.10 | −0.02 | 3.44 | 0.32 | |
30 | 38.26 | 5.11 | 9.39 | −0.01 | 0.79 | 0.09 | −0.03 | 3.88 | 0.31 | |
PS | 12 | 49.54 | 124.18 | 20.78 | 0.00 | 0.02 | 0.47 | 0.00 | 0.06 | 0.63 |
18 | 43.66 | 49.82 | 20.27 | 0.00 | 0.05 | 0.47 | 0.00 | 0.20 | 0.65 | |
24 | 39.87 | 15.23 | 18.72 | 0.00 | 0.12 | 0.46 | 0.00 | 0.60 | 0.64 | |
30 | 36.56 | 5.61 | 18.36 | 0.00 | 0.05 | 0.33 | −0.01 | 0.71 | 0.56 | |
FP | 12 | 49.44 | 76.49 | 6.50 | 0.00 | 0.07 | 0.20 | 0.00 | 0.09 | 0.55 |
18 | 45.01 | 23.02 | 5.40 | 0.00 | 0.07 | 0.22 | 0.00 | 0.25 | 0.57 | |
24 | 42.28 | 4.48 | 3.93 | 0.00 | 0.16 | 0.21 | 0.00 | 0.80 | 0.49 | |
30 | 39.37 | 1.39 | 4.03 | 0.01 | 0.29 | 0.15 | 0.00 | 0.73 | 0.45 | |
KS | 12 | 49.50 | 66.53 | 6.55 | 0.01 | 0.07 | 0.09 | 0.01 | 0.12 | 0.53 |
18 | 45.56 | 19.26 | 5.05 | 0.01 | 0.11 | 0.08 | 0.00 | 0.37 | 0.54 | |
24 | 43.02 | 3.98 | 3.96 | 0.00 | 0.23 | 0.06 | 0.00 | 0.95 | 0.46 | |
30 | 40.11 | 1.27 | 4.04 | 0.01 | 0.24 | 0.04 | 0.00 | 1.43 | 0.37 | |
PS | 12 | 49.96 | 281.31 | 72.57 | 0.00 | 0.06 | 0.22 | 0.01 | 0.29 | 0.49 |
18 | 45.09 | 138.68 | 70.08 | 0.00 | 0.10 | 0.20 | 0.01 | 0.60 | 0.47 | |
24 | 40.91 | 58.89 | 65.52 | 0.00 | 0.12 | 0.17 | 0.00 | 1.03 | 0.47 | |
30 | 37.50 | 25.09 | 60.40 | −0.01 | 0.10 | 0.11 | −0.01 | 1.20 | 0.40 | |
Average | 0.00 | 0.23 | 0.22 | 0.00 | 0.88 | 0.49 |
In Cond _{ MaxSad }, a termination condition of Cond _{ MaxSad } in Eq. (6) is combined with the original Enhanced Predictive Zonal Search. Compared to Fig. 3, the sizes of the gray regions in Fig. 4 are increased. Hence, RR of the Cond _{ MaxSad } is more than that of the Cond _{ MinSad }. However, since parts of the gray regions are outside the rectangular boxes, the search with Cond _{ MaxSad } will be sometimes terminated at the improper position, and ΔBR is increased compared to Cond _{ MinSad } or the conventional method. Table 3 shows the above trade-off between speed and accuracy. Nowadays, prices for storage devices become cheap. Accordingly, in some applications, an increase of 0.88 % in bitrate is not critical while the reduction of computational cost is important. Cond _{ MaxSad } will be beneficial for those applications.
Performance of the proposed algorithm by combining with a simple threshold technique [4]
Sequence | QP | Cond _{ MinSad } | Cond _{ M i n S a d,S i m } | ||||
---|---|---|---|---|---|---|---|
ΔPSNR | ΔBR | RR | ΔPSNR | ΔBR | RR | ||
(dB) | (%) | (dB) | (%) | ||||
BQT | 12 | 0.01 | 0.08 | 0.48 | 0.01 | 0.17 | 0.58 |
18 | 0.00 | 0.23 | 0.47 | 0.00 | 0.37 | 0.58 | |
24 | 0.00 | 0.59 | 0.49 | 0.00 | 0.77 | 0.59 | |
30 | 0.00 | 1.22 | 0.44 | 0.00 | 1.32 | 0.48 | |
BD | 12 | 0.01 | −0.01 | 0.22 | 0.01 | 0.00 | 0.25 |
18 | 0.01 | 0.15 | 0.22 | 0.01 | 0.19 | 0.25 | |
24 | 0.00 | 0.22 | 0.25 | 0.00 | 0.36 | 0.28 | |
30 | 0.00 | 0.15 | 0.14 | 0.00 | 0.29 | 0.15 | |
K | 12 | 0.00 | 0.02 | 0.16 | 0.00 | 0.04 | 0.24 |
18 | −0.01 | 0.12 | 0.17 | −0.01 | 0.16 | 0.25 | |
24 | 0.00 | 0.06 | 0.16 | 0.00 | 0.20 | 0.22 | |
30 | 0.00 | −0.02 | 0.07 | 0.00 | 0.02 | 0.12 | |
CP | 12 | 0.01 | 0.52 | 0.10 | 0.01 | 1.11 | 0.18 |
18 | 0.00 | 0.64 | 0.10 | 0.00 | 1.40 | 0.18 | |
24 | 0.00 | 0.80 | 0.10 | 0.00 | 1.69 | 0.18 | |
30 | −0.01 | 0.79 | 0.09 | −0.01 | 1.42 | 0.16 | |
PS | 12 | 0.00 | 0.02 | 0.47 | 0.00 | 0.02 | 0.50 |
18 | 0.00 | 0.05 | 0.47 | 0.00 | 0.06 | 0.52 | |
24 | 0.00 | 0.12 | 0.46 | 0.00 | 0.16 | 0.50 | |
30 | 0.00 | 0.05 | 0.33 | 0.00 | 0.05 | 0.33 | |
FP | 12 | 0.00 | 0.07 | 0.20 | 0.00 | 0.01 | 0.50 |
18 | 0.00 | 0.07 | 0.22 | 0.00 | 0.06 | 0.52 | |
24 | 0.00 | 0.16 | 0.21 | 0.00 | 0.25 | 0.50 | |
30 | 0.01 | 0.29 | 0.15 | 0.01 | 0.14 | 0.33 | |
KS | 12 | 0.01 | 0.07 | 0.09 | 0.01 | 0.04 | 0.64 |
18 | 0.01 | 0.11 | 0.08 | 0.01 | 0.08 | 0.56 | |
24 | 0.00 | 0.23 | 0.06 | 0.00 | 0.57 | 0.42 | |
30 | 0.01 | 0.24 | 0.04 | 0.01 | 0.53 | 0.30 | |
PS | 12 | 0.00 | 0.06 | 0.22 | 0.00 | 0.11 | 0.30 |
18 | 0.00 | 0.10 | 0.20 | 0.00 | 0.22 | 0.28 | |
24 | 0.00 | 0.12 | 0.17 | 0.00 | 0.31 | 0.24 | |
30 | −0.01 | 0.10 | 0.11 | −0.01 | 0.20 | 0.16 | |
Average | 0.00 | 0.23 | 0.22 | 0.00 | 0.39 | 0.36 |
Performance comparison
Sequence | QP | Ismail[9] | Cond _{ M i n S a d,S i m } | Ismail + Cond _{ M i n S a d,S i m } | ||||||
---|---|---|---|---|---|---|---|---|---|---|
ΔPSNR | ΔBR | RR | ΔPSNR | ΔBR | RR | ΔPSNR | ΔBR | RR | ||
(dB) | (%) | (dB) | (%) | (dB) | (%) | |||||
BQT | 12 | 0.02 | 0.05 | 0.29 | 0.01 | 0.17 | 0.58 | 0.02 | 0.15 | 0.67 |
18 | 0.00 | 0.14 | 0.29 | 0.00 | 0.37 | 0.58 | 0.00 | 0.39 | 0.67 | |
24 | −0.01 | 0.21 | 0.27 | 0.00 | 0.77 | 0.59 | −0.01 | 0.79 | 0.66 | |
30 | 0.00 | 0.11 | 0.21 | 0.00 | 1.32 | 0.48 | −0.01 | 1.32 | 0.59 | |
BD | 12 | 0.02 | −0.11 | 0.28 | 0.01 | 0.00 | 0.25 | 0.03 | −0.07 | 0.39 |
18 | 0.01 | 0.13 | 0.28 | 0.01 | 0.19 | 0.25 | 0.01 | 0.25 | 0.38 | |
24 | −0.01 | 0.03 | 0.28 | 0.00 | 0.36 | 0.28 | −0.01 | 0.36 | 0.38 | |
30 | 0.00 | 0.04 | 0.20 | 0.00 | 0.29 | 0.15 | 0.00 | 0.35 | 0.27 | |
K | 12 | 0.00 | −0.02 | 0.31 | 0.00 | 0.04 | 0.24 | 0.00 | 0.01 | 0.37 |
18 | −0.01 | 0.08 | 0.33 | −0.01 | 0.16 | 0.25 | −0.01 | 0.15 | 0.38 | |
24 | −0.01 | −0.08 | 0.29 | 0.00 | 0.20 | 0.22 | −0.01 | 0.08 | 0.33 | |
30 | 0.00 | −0.02 | 0.22 | 0.00 | 0.02 | 0.12 | 0.00 | 0.06 | 0.26 | |
CS | 12 | 0.01 | 0.43 | 0.12 | 0.01 | 1.11 | 0.18 | 0.02 | 1.32 | 0.25 |
18 | −0.01 | 0.59 | 0.12 | 0.00 | 1.40 | 0.18 | 0.00 | 1.75 | 0.24 | |
24 | −0.01 | 0.41 | 0.13 | 0.00 | 1.69 | 0.18 | −0.01 | 1.82 | 0.24 | |
30 | 0.00 | 0.09 | 0.10 | −0.01 | 1.42 | 0.16 | −0.01 | 1.48 | 0.22 | |
P | 12 | 0.00 | −0.01 | 0.40 | 0.00 | 0.02 | 0.50 | 0.00 | 0.01 | 0.60 |
18 | 0.00 | 0.01 | 0.40 | 0.00 | 0.06 | 0.52 | 0.00 | 0.07 | 0.60 | |
24 | 0.00 | 0.09 | 0.32 | 0.00 | 0.16 | 0.50 | 0.00 | 0.22 | 0.57 | |
30 | 0.00 | 0.08 | 0.24 | 0.00 | 0.05 | 0.33 | 0.00 | 0.20 | 0.48 | |
FP | 12 | 0.00 | 0.00 | 0.59 | 0.00 | 0.01 | 0.50 | 0.00 | 0.01 | 0.67 |
18 | −0.01 | −0.01 | 0.54 | 0.00 | 0.06 | 0.52 | 0.00 | 0.04 | 0.61 | |
24 | 0.00 | 0.16 | 0.39 | 0.00 | 0.25 | 0.50 | 0.00 | 0.32 | 0.48 | |
30 | 0.00 | 0.01 | 0.29 | 0.01 | 0.14 | 0.33 | 0.01 | 0.27 | 0.44 | |
KS | 12 | 0.01 | 0.01 | 0.64 | 0.01 | 0.04 | 0.64 | 0.01 | 0.06 | 0.70 |
18 | 0.00 | 0.03 | 0.56 | 0.01 | 0.08 | 0.56 | 0.00 | 0.12 | 0.62 | |
24 | 0.00 | 0.13 | 0.44 | 0.00 | 0.57 | 0.42 | 0.00 | 0.75 | 0.51 | |
30 | 0.00 | 0.35 | 0.32 | 0.01 | 0.53 | 0.30 | 0.00 | 0.50 | 0.44 | |
PS | 12 | 0.01 | 0.05 | 0.22 | 0.00 | 0.11 | 0.30 | 0.01 | 0.16 | 0.38 |
18 | 0.00 | 0.17 | 0.20 | 0.00 | 0.22 | 0.28 | 0.00 | 0.36 | 0.35 | |
24 | −0.01 | 0.19 | 0.16 | 0.00 | 0.31 | 0.24 | −0.01 | 0.48 | 0.30 | |
30 | −0.01 | 0.10 | 0.11 | −0.01 | 0.20 | 0.16 | −0.01 | 0.31 | 0.22 | |
Average | 0.00 | 0.11 | 0.30 | 0.00 | 0.39 | 0.36 | 0.00 | 0.44 | 0.45 |
Here, B _{W} and B _{H} are the width and height of a block to be considered. As in the table, Cond _{ M i n S a d,S i m } successively reduces the motion estimation time, and the average reduction ratio is 0.36. While the average reduction ratio of Cond _{ M i n S a d,S i m } is better than that of Ismail, the PSNR performance of Cond _{ M i n S a d,S i m } is worse than that of Ismail. However, although the proposed method does not outperform Ismail’s method, the proposed algorithm can be used to complement Ismail’s method, in order to further improve its search speed. While Ismail’s method considers the SAD values of blocks temporally and spatially neighboring to determine the termination condition, the proposed algorithm takes into account the self-characteristic of a current block for calculating the termination condition. Since their termination conditions are exclusive, the merged algorithm further accelerates the search speed. Two termination conditions of Ismail’s and proposed methods are simultaneously considered in Ismail + Cond _{ M i n S a d,S i m }, in order to show how the proposed algorithm complements Ismail’s method. The method chooses the maximal threshold value between Ismail’s and Cond _{ M i n S a d,S i m } to combine two algorithms together. The reduction ratio of Ismail + Cond _{ M i n S a d,S i m } is more than those of Cond _{ M i n S a d,S i m } and Ismail’s method. This simulation results show that the proposed algorithm also complements Ismail’s method. Hence, although the proposed algorithm does not outperform the existing methods, it can be used to accelerate the search speed, and it makes the proposed algorithm meaningful.
Performance comparisons of Cond _{ MaxSad }, Cond _{ MinSad }, Cond _{ M i n S a d,S i m }, Ismail [9], and Ismail + Cond _{ M i n S a d,S i m }
ΔPSNR | ΔBR | RR | |
---|---|---|---|
(dB) | (%) | ||
Cond _{ MaxSad } | 0.00 | 0.88 | 0.49 |
Cond _{ MinSad } | 0.00 | 0.23 | 0.22 |
Cond _{ M i n S a d,S i m } | 0.00 | 0.39 | 0.36 |
Ismail [9] | 0.00 | 0.11 | 0.30 |
Ismail + Cond _{ M i n S a d,S i m } | 0.00 | 0.44 | 0.45 |
4 Conclusions
This paper presents a method to terminate early a motion search for a video encoder. Unlike the previous work, this paper introduces a SAD estimator by considering a two-dimensional space and studies the estimator to get a condition for terminating a motion search early. The proposed algorithm can easily be combined with most of the methods of fast motion estimation to reduce computational cost. While previous thresholding techniques consider SAD values of blocks neighboring a current block, the proposed algorithm takes into account the characteristic of the current block to predict the threshold. Hence, the proposed termination scheme can complement the previous early search termination techniques.
Declarations
Acknowledgements
This present research has been conducted through the Research Grant of Kwangwoon University in 2014. This research was supported by the Basic Science Research Program through the National Research Foundation of Korea (NRF) funded by the Ministry of Education, Science and Technology (NRF-2014R1A1A2054105).
Authors’ Affiliations
References
- T Wiegand, GJ Sullivan, G Bjøntegaard, A Luthra, Overview of the H.264/AVC video coding standard. IEEE Trans. Circ. Syst. Video Technol. 13(7), 560–576 (2003).View ArticleGoogle Scholar
- GJ Sullivan, J Ohm, WJ Han, T Wiegand, Overview of the high efficiency video coding (HEVC) standard. IEEE Trans. Circ. Syst. Video Technol. 22(12), 1649–1668 (2012).View ArticleGoogle Scholar
- S Zhu, K-K Ma, A new diamond search algorithm for fast block matching motion estimation. IEEE Trans. Image Process. 9(2), 287–290 (2000).MathSciNetView ArticleGoogle Scholar
- PI Hosur, KK Ma, in Proc. 2nd Int. Conf. Information Communications and Signal Processing (ICICS ’99). Motion vector field adaptive fast motion estimation (Singapore, Dec. 1999), pp. 7–10.Google Scholar
- AM Tourapis, OC Au, ML Liou, Highly efficient predictive zonal algorithms for fast block-matching motion estimation. IEEE Trans. Circ. Syst. Video Technol. 12(10), 934–947 (2002).View ArticleGoogle Scholar
- C Zhu, X Lin, LP Chau, KP Lim, HA Ang, CY Ong, in Proc. ICASSP. A novel hexagon-based search algorithm for fast block motion estimation (Utah, USA, 2001), pp. 1593–1596. 7-11 May 2001.Google Scholar
- ZB Chen, P Zhou, Y He, in JVT-F017, 6th Meeting, Awaji. Fast integer pel and fractional pel motion estimation for JVT (Awaji, Japan, Dec. 2002).Google Scholar
- Y Ismail, M Shaaban, M Bayoumi, in Proc. IEEE ISCAS. An adaptive block size phase correlation motion estimation using adaptive early search termination technique (New Orleans, USA, 2007), pp. 3423–3426. 27-30 May 2007.Google Scholar
- Y Ismail, JB McNeely, M Shaaban, H Mahmoud, MA Bayoumi, Fast motion estimation system using dynamic models for H.264/AVC video coding. IEEE Trans. Circ. Syst. Video Technol. 22(1), 28–42 (2012).View ArticleGoogle Scholar
- Y Lee, JB Ra, Fast motion estimation robust to random motions based on a distance prediction. IEEE Trans. Circ. Syst. Video Technol. 16(7), 869–875 (2006).View ArticleGoogle Scholar
- JM reference software (Aug 1 2014). http://iphome.hhi.de/suehring/tml/.
Copyright
Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made.