Some tracking algorithms based on Siamese network have made great progress in similarity learning via features cross-correlation between an object branch and a search branch. However, it is significantly challenging for object tracking in video sequences in terms of target deformation with greatly varying. We propose a Siamese network based on global and local feature matching for object tracking including three phases with the aim of addressing the above issues. In the first phase, obtaining the global similarity matching and local relational mapping similarity of the template branch and the search branch by a selection mechanism of object template-aware features are to reduce the impact of background features on the local matching. In the second phase, introducing correlation matching of the local feature for establishing correspondence among partial-level pixels. Finally, combining the classification and regression results with global matching features and local matching features in a weighted fusion. Extensive experiments are conducted on datasets (OTB-100, LaSOT and GOT-10K) demonstrate that the proposed network enables to achieve superiority compared against the state-of-the-art method and provides an efficient scenario for tackling the issue. |
ACCESS THE FULL ARTICLE
No SPIE Account? Create one
CITATIONS
Cited by 3 scholarly publications.
Detection and tracking algorithms
Video
Deformation
Convolution
Target detection
Feature extraction
Education and training