VFMVAC: View-filtering-based multi-view aggregating convolution for 3D shape recognition and retrieval
作者:
Highlights:
• We propose a high-precision 3D shape multi-view recognition framework which can highly promote the performance of the 3D shape classification and retrieval.
• A voting-based view filtering algorithm is proposed; this algorithm can select the most representative views among the existing views to represent 3D shapes, thereby significantly improving memory usage efficiency and reducing the computational cost.
• A novel multi-view aggregating module is designed; in particular, the k-view features are shuffled using a cross-view channel shuffle module that considers the combination of features across views, thereby allowing for their sufficient fusion; furthermore, this module fuses the multi-view features via an aggregating convolution and considers all features of each view, thereby avoiding information loss induced by the traditional pooling methods.
• The proposed framework achieves state-of-the-art recognition and retrieval performance on benchmark datasets.
摘要
•We propose a high-precision 3D shape multi-view recognition framework which can highly promote the performance of the 3D shape classification and retrieval.•A voting-based view filtering algorithm is proposed; this algorithm can select the most representative views among the existing views to represent 3D shapes, thereby significantly improving memory usage efficiency and reducing the computational cost.•A novel multi-view aggregating module is designed; in particular, the k-view features are shuffled using a cross-view channel shuffle module that considers the combination of features across views, thereby allowing for their sufficient fusion; furthermore, this module fuses the multi-view features via an aggregating convolution and considers all features of each view, thereby avoiding information loss induced by the traditional pooling methods.•The proposed framework achieves state-of-the-art recognition and retrieval performance on benchmark datasets.
论文关键词:Multi-view,Channel shuffle,Convolution,Recognition,Retrieval
论文评审过程:Received 30 August 2021, Revised 5 April 2022, Accepted 1 May 2022, Available online 3 May 2022, Version of Record 9 May 2022.
论文官网地址:https://doi.org/10.1016/j.patcog.2022.108774