Familiarity based unified visual attention model for fast and robust object recognition

作者:

Highlights:

摘要

Even though visual attention models using bottom-up saliency can speed up object recognition by predicting object locations, in the presence of multiple salient objects, saliency alone cannot discern target objects from the clutter in a scene. Using a metric named familiarity, we propose a top-down method for guiding attention towards target objects, in addition to bottom-up saliency. To demonstrate the effectiveness of familiarity, the unified visual attention model (UVAM) which combines top-down familiarity and bottom-up saliency is applied to SIFT based object recognition. The UVAM is tested on 3600 artificially generated images containing COIL-100 objects with varying amounts of clutter, and on 126 images of real scenes. The recognition times are reduced by 2.7× and 2×, respectively, with no reduction in recognition accuracy, demonstrating the effectiveness and robustness of the familiarity based UVAM.

论文关键词:Visual attention,Object recognition,Scene analysis

论文评审过程:Received 8 January 2009, Revised 10 June 2009, Accepted 30 July 2009, Available online 11 August 2009.

论文官网地址:https://doi.org/10.1016/j.patcog.2009.07.014