Robust sequential view planning for object recognition using multiple cameras

作者：

Highlights：

•

摘要

While prior relevant research in active object recognition/pose estimation has mostly focused on single-camera systems, we propose two multi-camera solutions to this problem that can enhance object recognition rate, particularly in the presence of occlusion. In the proposed methods, multiple cameras simultaneously acquire images from different view angles of an unknown, randomly occluded object belonging to a set of a priori known objects. By processing the available information within a recursive Bayesian framework at each step, the recognition algorithms attempt to classify the object, if its identity/pose can be determined with a high confidence level. Otherwise, the algorithms would compute the next most informative camera positions for capturing more images. The principle component analysis (PCA) is used to produce a measurement vector based on the acquired images. Occlusions in the images are handled by a novel probabilistic modelling approach that can increase the robustness of the recognition process with respect to structured noise. The camera positions at each recognition step are selected based on two statistical metrics quantifying the quality of the observations, namely the mutual information (MI) and the Cramér-Rao lower bound (CRLB). While the former has also been used in a prior relevant work, the latter is new in the context of object recognition. Extensive Monte Carlo experiments conducted with a two-camera system demonstrate the effectiveness of the proposed approaches.

论文关键词：Active object recognition,Pose estimation,View planning,Occlusion,Sensor fusion,Machine vision,Cramer-Rao lower bound,Mutual information

论文评审过程：Received 12 October 2006, Revised 8 January 2008, Accepted 29 September 2008, Available online 15 October 2008.

论文官网地址：https://doi.org/10.1016/j.imavis.2008.09.009