PICASO: PIxel correspondences and SOft match selection for real-time tracking

作者:

Highlights:

摘要

Visual tracking is one of the computer vision’s longstanding challenges, with many methods as a result. While most state-of-the-art methods trade-off performance for speed, we propose PICASO, an efficient, yet strongly performing tracking scheme. The target object is modeled as a set of pixel-level templates with weak configuration constraints. The pixels of a search window are matched against those of the surrounding context and of the object model. To increase the robustness, we match also from the object to the search window, and the pairs matching in both directions are the correspondences used to localize. This localization process is robust, also against occlusions which are explicitly modeled. Another source of robustness is that the model – as in several other modern trackers –gets constantly updated over time with newly incoming information about the target appearance. Each pixel is described by its local neighborhood. The match of a pixel is taken to be the one with the largest contribution in its sparse decomposition over a set of pixels. For this soft match selection, we analyze both l1 and l2-regularized least squares formulations and the recently proposed l1-constrained ‘Iterative Nearest Neighbors’ approach. We evaluate our tracker on standard videos for rigid and non-rigid object tracking. We obtain excellent performance at 42fps with Matlab on a CPU.

论文关键词:

论文评审过程:Received 10 July 2015, Revised 25 November 2015, Accepted 3 February 2016, Available online 21 November 2016, Version of Record 21 November 2016.

论文官网地址:https://doi.org/10.1016/j.cviu.2016.02.002