Graph neural network for 6D object pose estimation

作者:

Highlights:

摘要

6D object pose estimation plays an important role in various applications such as robot manipulation and virtual reality. In this paper, we introduce a graph convolution neural network based method to addresses the problem of estimating the 6D pose of objects from a single RGB-D image. The proposed method fuses the appearance feature of the RGB image with the geometry feature of point clouds to predict pixel-level pose and the network also predicts pixel-level confidences to prune outlier predictions. The inner structure information of point cloud is learned by a graph convolution neural network. Specially, we adopt a residual graph convolution module to learn a discriminative feature. Our network enables end-to-end training and fast inference. The extensive experiments verify the method and the model achieves state-of-the-art for the LINEMOD and LINEMOD-OCCLUSION dataset (ADD-S: 88.68 and 65.38 respectively).

论文关键词:Pose estimation,Image processing,Deep learning

论文评审过程:Received 23 August 2020, Revised 10 January 2021, Accepted 24 January 2021, Available online 6 February 2021, Version of Record 16 February 2021.

论文官网地址:https://doi.org/10.1016/j.knosys.2021.106839