Combined segmentation, reconstruction, and tracking of multiple targets in multi-view video sequences

作者:

Highlights:

摘要

Tracking of multiple targets in a crowded environment using tracking by detection algorithms has been investigated thoroughly. Although these techniques are quite successful, they suffer from the loss of much detailed information about targets in detection boxes, which is highly desirable in many applications like activity recognition. To address this problem, we propose an approach that tracks superpixels instead of detection boxes in multi-view video sequences. Specifically, we first extract superpixels from detection boxes and then associate them within each detection box, over several views and time steps that lead to a combined segmentation, reconstruction, and tracking of superpixels. We construct a flow graph and incorporate both visual and geometric cues in a global optimization framework to minimize its cost. Hence, we simultaneously achieve segmentation, reconstruction and tracking of targets in video. Experimental results confirm that the proposed approach outperforms state-of-the-art techniques for tracking while achieving comparable results in segmentation.

论文关键词:

论文评审过程:Received 17 May 2016, Revised 7 August 2016, Accepted 15 August 2016, Available online 16 August 2016, Version of Record 6 December 2016.

论文官网地址:https://doi.org/10.1016/j.cviu.2016.08.006