Enhancing 3D-2D Representations for Convolution Occupancy Networks

作者:

Highlights:

• Proposed a Position-Aware Transformer (PAT) architecture that leverages the Adaptive Multi-Scale Position Encoding (AMSPE) to encode both global and local position information for better point feature representations.

• Proposed a 3D Correlation-Guided Enhancement (CGE) network to alleviate the ambiguous or noisy representations in the 2D feature projection.

• The proposed method outperforms the state-of-the-art methods in both quantitative and qualitative comparisons.

摘要

•Proposed a Position-Aware Transformer (PAT) architecture that leverages the Adaptive Multi-Scale Position Encoding (AMSPE) to encode both global and local position information for better point feature representations.•Proposed a 3D Correlation-Guided Enhancement (CGE) network to alleviate the ambiguous or noisy representations in the 2D feature projection.•The proposed method outperforms the state-of-the-art methods in both quantitative and qualitative comparisons.

论文关键词:Implicit 3D representation,Multi-scale 3D position encoding,3D Correlation-Guided Attentions

论文评审过程:Received 8 February 2022, Revised 16 September 2022, Accepted 4 October 2022, Available online 13 October 2022, Version of Record 19 October 2022.

论文官网地址:https://doi.org/10.1016/j.patcog.2022.109097