Fast visual tracking by temporal consensus

作者:

Highlights:

摘要

At the heart of every model-based visual tracker lies a pose estimation routine. Recent work has emphasized the use of least-squares techniques which employ all the available data to estimate the pose. Such techniques are, however, susceptible to the sort of spurious measurements produced by visual feature detectors, often resulting in an unrecoverable tracking failure. This paper investigates an alternative approach, where a minimal subset of the data provides the pose estimate, and a robust regression scheme selects the best subset. Bayesian inference in the regression stage combines measurements taken in one frame with predictions from previous frames, eliminating the need to further filter the pose estimates. The resulting tracker performs very well on the difficult task of tracking a human face, even when the face is partially occluded. Since the tracker is tolerant of noisy, computationally cheap feature detectors, frame-rate operation is comfortably achieved on standard hardware.

论文关键词:Visual tracking,Pose estimation,Robust regression,Bayesian inference,Model acquisition

论文评审过程:Received 20 February 1995, Accepted 20 April 1995, Available online 20 February 1999.

论文官网地址:https://doi.org/10.1016/0262-8856(95)01044-0