VFMVAC: View-filtering-based multi-view aggregating convolution for 3D shape recognition and retrieval

作者:

Highlights:

• We propose a high-precision 3D shape multi-view recognition framework which can highly promote the performance of the 3D shape classification and retrieval.

• A voting-based view filtering algorithm is proposed; this algorithm can select the most representative views among the existing views to represent 3D shapes, thereby significantly improving memory usage efficiency and reducing the computational cost.

• A novel multi-view aggregating module is designed; in particular, the k-view features are shuffled using a cross-view channel shuffle module that considers the combination of features across views, thereby allowing for their sufficient fusion; furthermore, this module fuses the multi-view features via an aggregating convolution and considers all features of each view, thereby avoiding information loss induced by the traditional pooling methods.

• The proposed framework achieves state-of-the-art recognition and retrieval performance on benchmark datasets.

摘要

•We propose a high-precision 3D shape multi-view recognition framework which can highly promote the performance of the 3D shape classification and retrieval.•A voting-based view filtering algorithm is proposed; this algorithm can select the most representative views among the existing views to represent 3D shapes, thereby significantly improving memory usage efficiency and reducing the computational cost.•A novel multi-view aggregating module is designed; in particular, the k-view features are shuffled using a cross-view channel shuffle module that considers the combination of features across views, thereby allowing for their sufficient fusion; furthermore, this module fuses the multi-view features via an aggregating convolution and considers all features of each view, thereby avoiding information loss induced by the traditional pooling methods.•The proposed framework achieves state-of-the-art recognition and retrieval performance on benchmark datasets.

论文关键词:Multi-view,Channel shuffle,Convolution,Recognition,Retrieval

论文评审过程:Received 30 August 2021, Revised 5 April 2022, Accepted 1 May 2022, Available online 3 May 2022, Version of Record 9 May 2022.

论文官网地址:https://doi.org/10.1016/j.patcog.2022.108774