A Bayesian approach to simultaneously recover camera pose and non-rigid shape from monocular images

作者:

Highlights:

摘要

In this paper we bring the tools of the Simultaneous Localization and Map Building (SLAM) problem from a rigid to a deformable domain and use them to simultaneously recover the 3D shape of non-rigid surfaces and the sequence of poses of a moving camera. Under the assumption that the surface shape may be represented as a weighted sum of deformation modes, we show that the problem of estimating the modal weights along with the camera poses, can be probabilistically formulated as a maximum a posteriori estimate and solved using an iterative least squares optimization. In addition, the probabilistic formulation we propose is very general and allows introducing different constraints without requiring any extra complexity. As a proof of concept, we show that local inextensibility constraints that prevent the surface from stretching can be easily integrated.An extensive evaluation on synthetic and real data, demonstrates that our method has several advantages over current non-rigid shape from motion approaches. In particular, we show that our solution is robust to large amounts of noise and outliers and that it does not need to track points over the whole sequence nor to use an initialization close from the ground truth.

论文关键词:Deformable surfaces,Pose estimation,Bayesian belief networks,SLAM

论文评审过程:Received 27 January 2014, Accepted 2 May 2016, Available online 8 June 2016, Version of Record 22 June 2016.

论文官网地址:https://doi.org/10.1016/j.imavis.2016.05.012