 Research
 Open Access
 Published:
Improved CT algorithm based on target block division and feature points matching
EURASIP Journal on Image and Video Processing volume 2018, Article number: 60 (2018)
Abstract
For compressive tracking (CT) algorithm, it is vulnerable to the occlusion, when tracking targets. An improved CT algorithm based on target division and feature point matching is proposed in this paper, which can determine different target tracking states by the method of target division. When the target is in normal tracking or partial occlusion, the target is located accurately by the subblock with the highest discrimination degree. In this scenario, the classifier only updates the unblocked sub regions in order to avoid the error of updating the occlusion information. When the target is completely occluded or lost in some frames, ORB feature matching is used to relocate the target. Experimental results show that our proposed CT algorithm can improve the robustness of the algorithm and reduces the drift problem.
Introduction
Target tracking has a very important application in the field of computer vision (motion analysis, intelligent monitoring, imaging guidance, humancomputer interaction). At present, there are some robust tracking algorithms. For example, super pixel image segmentation method is introduced to track target in the case of occlusion and texture changes [1,2,3]. A tracking algorithm based on multiple stable points is proposed in [4], which have achieved good results in the tracking of nonrigid object. Although these algorithms are highly robust, they are quite timeconsuming, which does not cater to the realtime tracking requirements.
In realtime object tracking, tracking is regarded as online classification, which means using classifiers finds the decision boundaries to separate objects from background. Such algorithms include online learning tracking [5], realtime tracking based on online boosting algorithm [6], online tracking algorithm based on semisupervised [7, 8], and so on. In recent years, compression sensing theory is introduced into target tracking [9, 10]. The compressive tracking (CT) method is proposed in [10,11,12], which compresses the target feature with a sparse measurement matrix and then inputs the compressed feature into the Bayesian classifier. Candidate region with the maximum confidence is selected as target tracking region; thus, the algorithm is adaptive. As this algorithm effectively compresses the target feature, it has better realtime performance. However, there are still some defects in the algorithm. The main problems are concentrated on the following two aspects.

1.
Tracking drift problem
In tracking the moving target, it tends to introduce more background content once the target is blocked. As a result, the classifier learns a lot of obstruction information instead of the real target information, and then, the classifier’s parameters fit more to the obstruction than to the real targets, which makes the tracking position deviate from the target region. In the subsequent tracking process, this offset will gradually accumulate, resulting in tracking failure [13, 14].

2.
Lack of judgment on occlusion
The updating of the classifier parameters of the compressive tracking algorithm is based on the statistical characteristics of the positive and negative samples in the current frame compression domain and the model parameters of the positive and negative samples of the previous frame. When the target is occluded, the extracted compressed feature of the target will undoubtedly lead the samples to be incorrectly classified. In other words, if the classifier is updated on the basis of wrong positive and negative sample, it will result in a decline in the classification performance of the classifier [15]. Therefore, the updating process of the original algorithm is blind and lacks the basic decision mechanism. In addition, there is no corresponding memory mechanism for historical samples. Moreover, once the tracking fails, the algorithm has no retracing ability.
For overcoming the shortcomings of the compressive tracking algorithm, this paper proposes an improved method, which mainly includes two aspects:

1.
The status of the target can be determined by dividing the subregions. When the target is in normal tracking or partially occluded, the classifier only updates the unblocked subregion to avoid the error of the classifier updating occlusion information.

2.
When the target is in complete occlusion or lost, ORiented Brief (ORB) feature matching is used to relocate the target.
The experimental results show that the improved compressive tracking algorithm has higher robustness and realtime performance.
Compressive tracking methods
In [9], feature V ∈ R^{m × 1} is extracted based on compressive process, which is defined as:
where X ∈ R^{n × 1} is the signal translated from target candidate region and R ∈ R^{m × n}(m < < n)is the feature extraction matrix. R can be defined as:
where s is produced randomly from 2 to 4. As the dimension of X is very large, if the Eq. (2) is used directly, the computational cost is very high. Therefore, the target is often randomly divided into some subregions. Feature V is extracted as:
where Rects is the subregion produced randomly from target candidate region, NR is the number of subregions, and r_{i, j} is randomly selected from 1 to − 1. From Eq. (3), we can see that the compression feature is the weighted sum of the original feature using r_{i, j} as the weight.
The tracking process is to find target candidate regions, which has the maximum confidenceH(v). Assuming that all elements in V are independently distributed, H(v) can be defined as
where y ∈ {0, 1} represents sample label, y = 1 represents the positive sample (target), and y = 0 represents the negative sample (background); supposing the prior probabilities of the target and background are equal, that is to say p(y = 1) = p(y = 0) = 1/2, the conditional distributions p(v_{i} y = 1) and p(v_{i} y = 0) are assumed to Gaussian distribution [16]:
where u^{1} and δ^{1} respectively represent the mean and standard deviation of the target samples; u^{0} and δ^{0} are the mean and standard deviation of the background samples, respectively. The target samples are from the region near the target, and the background samples are from the regions far from the target. At the beginning, \( {u}_i^1 \), \( {\delta}_i^1 \), \( {u}_i^0 \), and \( {\delta}_i^0 \) are initialized with the first frame. In the following tracking process, \( {u}_i^1 \), \( {\delta}_i^1 \), \( {u}_i^0 \), and \( {\delta}_i^0 \) are continually updated; the method can be defined as:
where λ means the learning rate, the smaller λ value is, the faster the update rate is, and the less the reserved feature information is.
Although the CT algorithm has the advantages of fast extraction feature and fast detection, there are some shortcomings: in the process of tracking, if the occlusion problem occurs, the classifier can learn a lot of background information and confuse background features with target features, resulting in inaccurate classification and the lost of tracking targets. Aiming at this problem, a compressive tracking algorithm based on target bock division and feature point matching is proposed in this paper.
Improved CT method
On the basis of the original CT algorithm, the occlusion discrimination mechanism and a target relocation strategy are introduced to improve the accuracy and adaptability of the algorithm in this paper.
Occlusion discrimination
In the process of target tracking, other targets and backgrounds often block tracking target. To solve this problem, it is necessary to determine whether the target is obscured or not. So the target state is divided into normal state and occlusion state in this paper. According to the state of the target, different processing mechanisms are used respectively.
In the CT algorithm, the confidence H(v) of all candidate target regions is obtained by Eq. (4), and we select the region with the maximum confidence max(H(v)) as the target region. From Eq. (4), we can know that the value of H(v) is determined by (p(v_{i} y = 1)/p(v_{i} y = 0)).The more features of the target in the candidate region are close to the target, the more likely the candidate region is the target.
H(v) can only indicate which candidate region in the same frame is closer to the target region, but can not judge the deformation or occlusion of the target. We calculated the maximum confidence max(H(v)) of some frames in the car video, and the line diagram of max (H (V)) is shown as shown in Fig. 1.
From Fig. 1, we can see that max(H(v)) value at 127th frame, 136th frame, and 220th frame has been greatly reduced. Figure 2 shows the tracking target at 127th frame, 136th frame, and 220th frames.
It can be seen from Fig. 2 that all of three frames have different degrees of occlusion. When the tracking target is not occluded, max(H(v)) appears a trend of gradual growth and tends to be stable. Therefore, it can be judged by the change of max(H(v)) that the target has larger deformation or occlusion. Let C(k) be the discrimination degree for the occlusion or larger deformation of the kth frame, which can be defined as:
where H^{k − 1} represents the maximum confidence max(H(v)) at the frame k−1 and H^{k} represents the maximum confidence max(H(v)) at the frame k.
Given a threshold ξ, if C(k) > ξ, the target is considered as completely occluded or disappeared; if C(k) < ξ, the target is in the normal tracking state or partial occlusion state.
Occlusion tracking
In general, occlusion starts from the edge of the target. In the process of the occlusion, the feature of the region that is occluded will be lost, but the feature of the region that is not occluded still maintains the original feature information. As long as subregions without occlusion can be tracked, the target can be accurately located through subregions without occlusion. Therefore, a compressive tracking algorithm based on subregions is proposed in this paper. The main idea of the algorithm is to divide the target region into several subregions, the candidate regions of each subregion are created, the candidate region compression features of each subregion are extracted, H(v) is calculated by the Bayesian classifier, and the region with the maximum confidence in terms of the compression feature of candidate subregions is selected to calculate the discrimination degree C(k). If C(k) < ξ, the target region is located according to position of subregions without occlusion. If C(k) > ξ, the target is completely occluded or disappearing.
The division of subregions has a great impact on the tracking effect. Each subregion may be overlapped or nonoverlapping; subregion size can be fixed or adaptive. If the subregion is too large, it will be too sensitive to occlusion, while if the subregion is too small, it may lose target information and there will be too many subregions; thus, the computational cost of the algorithm will be very high. In this paper, the target is divided into four subregions.
As you can see from Fig. 3, if the subregion 1 is occluded, the upper left corner in the target area is occluded. If the subregions 1 and 2 are occluded, the upper half part in the target area is occluded, and so on. We can not only distinguish effectively whether the target is occluded but also determine the location of the target according to the subregions that have not been occluded through method for the subregion.
Suppose D_{i}^{k − 1}i ∈ {1, 2, 3, 4} denotes the i subregion in the frame k − 1, the i candidate subregions of the current frame can be defined as:
where γ denotes the neighborhood radius. Supposing T_{i}(m) denotes the mth candidate region of the subregion i, all candidate region compression features V(k) are computed according to Eq. (3), H(v) is computed according to Eq. (4), and select the candidate region with the maximum confidence to calculate the C(k). If C(k) > ξ, the target is completely blocked or disappearing, and the target is located according to the feature matching method; otherwise, the target is located according to position of the subregion with maximum C(k).
After the target is successfully located, the classifier parameters need to be updated according to the CT algorithm. In order to reduce the impact by occlusion, only the subregions without occlusion update the probability distribution of the features. Therefore, after the target position is determined, the main steps of updating classifier parameter are as follows:
Step 1: Calculating the confidence H_{i}(v) of all the subregions of target region;
Step 2: Calculating the discrimination degree C_{i}(k) of all subregions;
Step 3: Judging by the C_{i}(k) value whether the subregion is occluded or not. If C(k) > ξ, the classifier is not updated. If not, the classifier will be updated. The updating method is as follows:

1.
The positive and negative samples of subregions with C_{i}(k) < ζ are created. The specific method can be defined as:

2.
According to the Eq. (3), the feature V_{i} can be obtained. The calculation of the parameters \( {u}_i^1 \), \( {\delta}_i^1 \), \( {u}_i^0 \), and \( {\delta}_i^0 \) is as follows:
This algorithm model for subregions partition and classifier updating is shown in Fig. 4. As can be seen from Fig. 4, this method can not only judge the target state effectively, but also has good robustness to partial occlusion, local gray change, and deformations.
Target relocate
This method in this paper can deal well with the tracking problem of partial occlusion. But when the target is completely occluded or disappeared, the algorithm cannot track and locate the target accurately. The target detection mechanism in the TLD algorithm is introduced in this paper. A method of target relocation based on improved ORB feature matching is proposed. Firstly, fast corner points are gained, and then, false corner points are removed; finally, the BRIEF descriptor is used to describe the corner points.

(a)
Fast corner
In [17], in the neighborhood of a pixel, there are many pixels that are larger than or smaller than the gray level of that point; the pixel will be the corner point, which is defined as:
where I(p) means the gray value of the candidate pixel, I(x) represents any pixel of the circular boundary with p as the center, and ε_{d} is the threshold. Different threshold ε_{d} can be used to control the number of corner points, and the relationship between the threshold value and the number of corners is shown in Fig. 5. In order to quickly remove the false corner, ε_{d} value is 12 in this paper.

(b)
Removing edge points
Fast corner points include many edge points and local nonmaximum points. The curvature of edge point is larger in the direction perpendicular to the direction of the edge, and smaller along the direction of the edge, while the principal curvatures of the real corner points is larger in any direction [18]. Therefore, the edge points can be removed from fast corner points by principal curvature. In this paper, the principal curvature is calculated by the 2 × 2 Hessian matrix H, which is defined as:
The four elements of H can be obtained by the adjacent difference. According to the property of Hessian matrix, the principal curvature of H is proportional to the eigenvalue of Hessian matrix. Since the principal curvature of the real corner point is larger in any direction [18], if the difference between the two eigenvalues is larger, it shows that the candidate corner point is on the edge; otherwise, the candidate corner is the real corner point. Here, we do not directly calculate two eigenvalues but calculate the ratio of two eigenvalues. Let α be the larger eigenvalue of H matrix, β is its smaller eigenvalue.
In Lowe’s paper [19], α = γβ, ratio = (γ + 1)^{2}/γ, and (γ = 10). If ratio is less than (10 + 1)^{2}/10, the feature points are preserved, otherwise discarded.

(c)
Removing the pseudo corner points
The edge points can be removed via the steps (b). But there are still some local nonmaximum points. It can be further judged by calculating the Laplace value of the pixels in the small neighborhood around the candidate corner point; if the candidate corner point is the Laplace extreme point, then the corner point is preserved and vice versa [20]. The calculation of Laplace extremum is as follows:

(d)
The direction of the fast corner point
Fast corner point does not have direction. In [21], the direction of fast feature points is obtained by gray centroid method. The specific methods are as follows:
Firstly, the moment of the neighborhood of the feature point is computed. The i + j moment is defined as:
And then the centroid is obtained with these moments.
The orientation of the centroid then simply is
where\( {M}_{00}=\sum \limits_x\sum \limits_yI\left(x,y\right) \), \( {M}_{10}=\sum \limits_x\sum \limits_y xI\left(x,y\right) \), and \( {M}_{01}=\sum \limits_x\sum \limits_y yI\left(x,y\right) \).

(e)
BRIEF descriptor
The BRIEF descriptor is a bit string description of an image patch from a set of binary intensity tests; a binary test τ is defined as
wherep(⋅)denotes the function of binary comparisons and (x, y) is a sample pair. Each test sample is a randomly 5 × 5 window of a 31 × 31 pixel patch. The feature is defined as a vector of n binary tests:
As a result, the length of descriptor is n. n = 128, 256, 512…, in this paper, n = 256. The function p(x) is computed as a gray values sum in the5 × 5 window around pixel x. In order to improve the computational speed, the method of integrating graphs is used to compute the sum of the gray value of the image patch.
The BRIEF descriptor is robust to illumination changes, but it is sensitive to noise and rotation. In order to solve the noise sensitive problem, the image is preprocessed by Gauss filter in the ORB algorithm. In order to solve the problem of rotation invariance, for feature set of n, binary tests at location(x, y)define a 2 × n matrix.
The rotation matrix R_{θ} is generated using FAST principal direction angleθ, which is defined as:
Therefore, feature set with orientation at location (x, y) is defined as:
Now, the new feature descriptor becomes:
The BRIEF with orientation has larger variance and a mean near 0.5, which makes its description performance more irrelevant and distinguishable [22].
In the end, 256 high variance and uncorrelated binary strings are selected as the final ORB descriptors by greedy algorithm, and the specific steps are as follows:

1.
The first element in the set T, which is composed of all binary strings, is put into the result set R.

2.
The elements in the set T are in turn compared with the elements in the set R. If the correlation between them is greater than a given threshold value, the binary string is abandoned; otherwise, it will be added to the container R.

3.
Repeat step 2 until there are 256 elements in the result set R. If the number of elements in the result set R is less than 256, then the correlation threshold is increased and the greedy algorithm is performed again, until there are 256 binary strings in the result set R.
Figure 6 shows the matching results based on ORB features. As can be seen from Fig. 6, most of the matching points are focused on the right targets, and only a few of them are wrong.
In order to improve the robustness of the algorithm, the reference point of the target location is decided by the median values of the matched feature points in the horizontal and vertical, which is defined as:
where mid(x_{i})represents the median value of the horizontal coordinates and mid(y_{i})represents the median value of the vertical coordinates.
The reference point is selected as the center point of the target tracking, and the coordinates of the upper left corner of the tracking box are calculated according to Eq. (25), so that the position of the tracking target could be determined.
where rect. x and rect. y, respectively, mean the coordinate of upper left corner of tracking box and width and height are the tracking box’s width and height, respectively. The target detection based on ORB features is shown in Fig. 7:
The partial matching results based on the ORB on car video are shown in Fig. 8.
It can be seen from Fig. 8 that the method based on ORB feature matching can track the target accurately when there are many matching points. Combined with CT algorithm, the method in this paper can effectively improve the accuracy of tracking.
When the target is relocated by ORB feature, the accuracy of position depends on the number of successful matching feature points. When the target appears again, if there is a huge change in shape, the number of successful matching feature points may be less, which results in the difficulty of accurate positioning. Aiming at this problem, a matching template library is constructed in this paper. After the target is lost, all the templates in the template library are used to search the target. If a template in a template library matches a frame image, the number of matching points is greater than the given threshold value, the template is considered as the matching template. The matching template with the largest number of matching points is called the best matching template. Then, the location of the target is determined according to the matching points corresponding to the best matching template. In order to improve the speed of matching, there are not too many templates in the template library; therefore, in this paper, a new method for updating similar templates is proposed. Firstly, the tracking state is determined by tracking module. If the target is in the normal tracking state, the ORB feature of the target area is extracted. Then, the extracted ORB feature is matched with the template in the template library in turn. If the number of matching points is less than the given threshold t, this area is added to the template library, as shown in Fig. 9.
By updating the template library, some templates are stored in the template library, which improves the robustness of the matching. In order to improve the positioning accuracy, the best template matching strategy is proposed in this paper. The specific process is shown as follows:

1.
After the target is lost, all the templates in the template library are used to search the target, respectively.

2.
If a template exists in the template library, the number of matching points with the current image is greater than the given threshold, and then, the template is the matching template;

3.
If there may be multiple matching templates, the template with the largest number of matching points is considered as the best matching template;

4.
The location of the target is determined according to the best matching template, and the relocation is completed.
The template matching process is shown in Fig. 10:
Algorithm flow
The flow chart of the improved CT algorithm based on target division and feature point matching is shown in Fig. 11.
Results and discussion
In order to verify the effectiveness of this proposed algorithm, the algorithm is implemented by VS2010+OpenCV2.4 and ran on the computer of Intel Corei74510U @2.0GHz processor, and 8GB of RAM, using car video for testing, and compared with the original CT algorithm. The testing error in the experiment is given by the below equation:
where (x_{i}, y_{i}) and (x_{ti}, y_{ti}) represent the central point coordinates of the predicted target and the real target, respectively.
Comparison and analysis of partial occlusion
Figure 12 shows the partial tracking results of the two algorithms in the first 60 frames of the car video. The blue rectangle indicates the tracking result of CT algorithm, and the red rectangle is the tracking result of the improved algorithm. As can be seen from Fig. 12, since the 30th frame, the target is gradually getting away from sight of the camera. The tracking window of the original CT algorithm is constantly shifting to the right, while the target can be more accurately tracked using our improved method. Because the improved method is based on subregions, relied on the information of the nonobscured subregions, the position of the target can be predicted. After the 50th frame, the target is shown wholly in the video; the tracking window of the original CT algorithm is shifting to right for a large number of background information being counted into the update of parameters, while the target can be precisely tracked again using the improved algorithm. Figure 13 plots the tracking error in terms of precisely the first 60 frames. It can be seen from Fig. 13 that there is no larger occlusion in the first 20 frames of the car video, and the tracking error of the original CT algorithm and the improved algorithm in this paper is small. The target is gradually getting out of the camera scene from 30th frames to 40th frames, and the tracking error of the original CT algorithm is increasing, but our proposed algorithm can provide constantly good performance, due to the improved algorithm can determine the location of the obscured target more accurately based on the nonobscured subregions of the target. The target gradually reappears from 50th frames, and the tracking error of the original CT algorithm is still high; this is because the original CT algorithm has learned a lot of background information, which makes the predicted tracking position deviate from the target region, while the result of our improved algorithm is relatively stable. Therefore, the algorithm in this paper is more accurate and robust when the target is partially occluded.
Comparison and analysis of complete occlusion
Figure 14 shows the partial tracking results of the two algorithms in 560–600 frames of the car video. The blue rectangle box represents the tracking result of the CT algorithm. The red rectangle box is the tracking result of the improved algorithm. It can be seen from Fig. 14, from the 560th to the 565th frames, that the tracking box of the improved algorithm disappears, and the tracking results of the original CT algorithm shows a large offset. This is because the determination method of target disappearance is introduced in this paper. The original CT algorithm cannot judge the disappearance of the target, so when the target is completely blocked or completely off the scope of the camera, the CT algorithm updates a lot of background information; thus, background is wrongly used as a target. At the 570th frame, the tracking box of the proposed algorithm appears again, and the location of the target is quite accurately relocated; in contrast, the tracking result of the original CT algorithm deviates completely from the target. The performance advantage of our proposed algorithm should thank to the relocation method based on ORB feature matching.
Figure 15 shows the tracking error diagram from 560th frame to 600th frame. As the discriminating mechanism in the target occlusion or disappearance situations introduced in our improved algorithm, the tracking error from 560th frame to 568th frame cannot be calculated, so it is set to null value. We can tell from Fig. 15 that the proposed algorithm can relocate the target after the target is lost and the tracking error can be kept at a relatively stable and low level after the relocation, while in the CT algorithm, because the classifier learns the information of the obstruction, the obstruction might be regarded as the target by mistake, and then, the tracking error becomes larger and larger.
Table 1 shows the average error of the tracking results of the two algorithms in the car video. As can be seen from the table, this improved algorithm is better than the CT algorithm; the tracking accuracy is higher.
From the above two experiments, it can be seen that the original CT algorithm is potential to update a lot of background information because it does not judge the occlusion when facing the occlusion problem, which makes the background mistakenly considered as the target and makes the error larger; the improved algorithm can judge the target’s state effectively by dividing subregions. When the target is in normal tracking and partial occlusion, the target is located accurately using the subregions with the highest discriminate degree. The classifier only updates the unblocked subregion, which avoids updating background information. When the target is completely occluded or lost, the target can be located by the method of the ORB feature matching, which makes up with the deficiency of the original CT algorithm and improves the tracking accuracy.
Comparison and analysis of time efficiency
Table 2 lists the calculation speed of the two algorithms. As can be seen from Table 2, the proposed algorithm has not doubled the time efficiency compared with the original CT in the case of an increase of four times the number of candidate targets at most. This is because the most timeconsuming part of the calculation is the feature extraction; this part of the calculation is done by the integral graph in one time. At the same time, when facing partial occlusion, the classifier is only updated for the unblocked subregion, which also reduces the processing time.
Comparison between the algorithm in this paper and TLD algorithm
TLD algorithm is the most popular tracking algorithm, which combines tracking algorithm and recognition algorithm, and introduces the tracking failure identification mechanism. Inspired by the design of TLD algorithm, the algorithm in this paper introduces the target detection mechanism on the basis of original CT algorithm. Figure 16 shows the tracking effect of the TLD algorithm and the algorithm in this paper on the same frame of the car video.
It can be seen from Fig. 16 that the proposed algorithm has better stability. This is because our algorithm of this paper mainly depends on the CT algorithm. The CT algorithm has the advantages of classifier updating and learning, which is more accurate and robust than the LK optical flow in the TLD algorithm. In the TLD algorithm, if there is difference between the target and template in the TLD template library, it will be difficult for TLD algorithm to detect the target.
Table 3 lists the test results of the two algorithms. Table 3 indicates that the algorithm proposed in this paper shows better performance compared to the TLD algorithm, and false rate and missing rate are both lower.
Conclusions
Aiming at the problem of poor ability of antiocclusion in compressive tracking (CT), an improved CT algorithm based on target segmentation and feature point matching is proposed. The problem of target tracking in occlusion is solved by target segmentation. The target is captured again by local feature matching after the disappearance of the target. The experimental results show that the improved algorithm has strong robustness. However, the method still has some shortcomings. For example, when the target reappears, the scale of the target may change and the scale of the target tracking box needs to be solved after relocation.
Abbreviations
 CRF:

Corner response function
 CT:

Compressed tracking
 ORB:

ORiented Brief
 TLD:

Tracking learning detection
References
S Wang, H Lu, F Yang, et al., in Proceedings of the IEEE Conference on Computer Vision: November 6–13, 2011. Superpixel tracking (IEEE, Barcelona, 2014), pp. 1323–1330
A Bugeau, P Perez, Track and cut: simultaneous tracking and segmentation of multiple objects with graph cuts. EURASIP Journal on Image and Video Processing. 2, 447–454 (2008)
S Oron, A BarHillel, D Levi, S Avidan, Locally orderless tracking. Int. J. Comput. Vis. 111, 213–228 (2015)
J Kwon, KM Lee, in Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition: June 20–25, 2009. Tracking of a nonrigid object via patchbased dynamic appearance modeling and adaptive Basin Hopping Monte Carlo sampling (IEEE, Miami, FL, 2009), pp. 1208–1215
Z Kalal, J Matas, K Mikolajczyk, in Proceedings of the IEEE Conference on Computer Vision Workshops: Sept. 27 2009–Oct. 4 2009. Online learning of robust object detectors during unstable tracking (IEEE, Kyoto, 2009), pp. 1417–1424
G Helmut, in Proceedings of the British Machine Vision Conference: September 47, 2006. Realtime tracking via online boosting (IEEE, Edinburgh, 2006), pp. 47–56
A Adam, E Rivlin, I Shimshoni, in Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition: June 17–22, 2006. Robust Fragmentsbased Tracking using the Integral Histogram (IEEE, New York, 2006), pp. 798–805
SMS Nejhum, J Ho, MH Yang, Visual tracking with histograms and articulating blocks. Computer Vision & Image Understanding 114, 901–914 (2008)
HX Li, CH Shen, QF Shi, in Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition: June 20–25,2011. Realtime visual tracking using compressive sensing (IEEE, Colorado Springs, 2011), pp. 1305–1312
K Zhang, L Zhang, MH Yang, in Proceedings of the European Conference on Computer Vision: October 7–13, 2012. Realtime compressive tracking (Springer, Florence, 2012), pp. 866–879
PM Fonseca, J Nesvadba, Face tracking in the compressed domain. EURASIP Journal on Advances in Signal Processing. 1, 1–11 (2006)
K Zhang, L Zhang, MH Yang, Fast compressive tracking. IEEE Trans. Pattern Anal. Mach. Intell. 36(10), 2002–2015 (2014)
QP Zhu, J Yan, DX Deng, Compressive tracking via oversaturated subregion classifiers. IET Comput. Vis. 17(6), 448–455 (2013)
L ZHANG, J HAN, B LI, et al., The scale adaptive feature compressed tracking. J Natl Univ Defense Technol 35, 146–151 (2013)
PA Deotale, VJ Preetida, Object detection and localization using compressed sensing. Advances in Signal Processing and Intelligent Recognition Systems 678, 127–141 (2018)
DL Donoho, Compressed sensing. IEEE Trans. Inf. Theory 52, 1289–1306 (2006)
E Rosten, T Drummond, in Proceedings of the European Conference on Computer Vision: May 7–13, 2006. Machine learning for highspeed corner detection (Springer, Graz, 2006), pp. 430–443
MS Verkeenko, Development of an algorithm for fast corner points detection. Journal of Computer and Systems Sciences International 53(3), 392–401 (2014)
DG Lowe, Distinctive image features from scaleinvariant keypoints. Int. J. Comput. Vis. 60, 91–110 (2004)
TTH Tran, E Marchand, in Proceedings of the IEEE Conference on Robotics and Automation: April 10–14, 2007. Realtime keypoints matching: application to visual servoing (IEEE, Roma, 2007), pp. 3787–3792
E Rublee, V Rabaud, K Konolige, et al., in Proceedings of the IEEE Conference on Computer Vision: Nov 6–13, 2011. ORB: an efficient alternative to SIFT or SURF (IEEE, Barcelona, 2012), pp. 2564–2571
W Huang, LD Wu, HC Song, et al., RBRIEF: a robust descriptor based on random binary comparisons. IET Computer Vision 7(1), 29–35 (2013)
Acknowledgements
The authors thank the editor and anonymous reviewers for their helpful comments and valuable suggestions.
Funding
This work was supported in part by a grant from the National Natural Science Foundation of China (No. 61403060, 61603146), a grant from Sixtalent peaks project in Jiangsu Province, China (No. XYDXXJS012), Research on Natural Science of Colleges and Universities in Jiangsu(No.15KJA460003), Modern Educational Technology of Jiangsu (No.2016R49007), Topnotch Academic Programs Project of Jiangsu Higher Education Institutions,Science and Technology Project in Huai'an City(No.HAG201602), and the Key Laboratory of Intelligent City and Virtual Reality in Huai'an City(No.HAP201605).
Availability of data and materials
We can provide the data.
Author information
Affiliations
Contributions
All authors take part in the discussion of the work described in this paper. The author WW wrote the first version of the paper, the author WZ did part experiments of the paper, and MJ, YY, and XC revised the paper in different version of the paper, respectively. All authors read and approved the final manuscript.
Corresponding author
Ethics declarations
Authors’ information
Wenhao Wang received his B.S. degree in Mechatronics from Nanjing University of Science and Technology, Nanjing, China, in 2000, and the M.S. degree in software engineering from Nanjing University of Science and Technology, Nanjing, China, in 2004. He is currently an associate professor in the Faculty of Computer & software Engineering at Huaiyin Institute of Technology. His research interests include digital image processing, pattern recognition, and algorithm optimization. Contact: jswwh407@sina.com
Mingxin Jiang received her B.S. degree in Measurement & Control Technology and Instrument and the M.S. degree in Communications and Information System from Jilin University, Changchun, China, in 2002 and 2005. She received a Ph.D. degree in Signal and information processing, Dalian University of Technology, China, in 2013. She was a postdoctoral researcher with the Department of Electrical Engineering in Dalian University of Technology from 2013 to 2015. She is currently an associate professor in Faculty of Electronic information Engineering at Huaiyin Institute of Technology. Her research interests include multiobject tracking, video content analysis, and vision sensors for robotics. Contact: jiangmingxin@126.com
Yunyang Yan received his B.S. degree in Computer Science and Technology from Nanjing Institute of Aeronautics, Nanjing, China, in 1988, the M.S. degree in Computer Science and Technology from Southeast University, Nanjing, China, in 2002, and the Ph.D. degree in Pattern Recognition and Intelligent System from Nanjing University of Science and Technology, Nanjing, China, in 2008. He is currently a professor in Faculty of Computer & Software Engineering at Huaiyin Institute of Technology. His research interests include digital image processing, pattern recognition, and management information system. Contact: areyyyke@163.com
Xiaobing Chen received his B.S. degree in Mechanical Manufacture & Automation, the M.S. degree in Computer Graphics, and the Ph.D. degree in Aeronautical & Astronautic Manufacturing, from Nanjing University of Aeronautics & Astronautics, China, in 1992, 2003, and 2012, respectively. He is currently a professor in Faculty of Computer and Software Engineering at Huaiyin Institute of Technology. His research interests include computer graphics, 3D modeling, and 3D printing. Contact: hychen_xb@sohu.com
Wendong Zhao received his B.S. degree in Electronic Engineering and the M.S. degree in Computer Technology from Jiansu University, Zhenjian, China, in 1996 and 2007, respectively. He is currently an associate professor in Faculty of Computer and Software Engineering at Huaiyin Institute of Technology. His research interests include graphics and image processing, video content analysis, and Internet of Things engineering. Contact: zwd_139@163.com
Ethics approval and consent to participate
Approved.
Consent for publication
Approved.
Competing interests
The authors declare that they have no competing interests, and all authors have seen the manuscript and approved to submit to your journal. We confirm that the content of the manuscript has not been published or submitted for publication elsewhere.
Publisher’s Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made.
About this article
Cite this article
Wang, W., Jiang, M., Yan, Y. et al. Improved CT algorithm based on target block division and feature points matching. J Image Video Proc. 2018, 60 (2018). https://doi.org/10.1186/s136400180302x
Received:
Accepted:
Published:
DOI: https://doi.org/10.1186/s136400180302x
Keywords
 Target tracking
 Occlusion
 Subregion
 Feature points matching