A Task-driven Network for Mesh Classification and Semantic Part Segmentation (2306.05246v3)
Abstract: With the rapid development of geometric deep learning techniques, many mesh-based convolutional operators have been proposed to bridge irregular mesh structures and popular backbone networks. In this paper, we show that while convolutions are helpful, a simple architecture based exclusively on multi-layer perceptrons (MLPs) is competent enough to deal with mesh classification and semantic segmentation. Our new network architecture, named Mesh-MLP, takes mesh vertices equipped with the heat kernel signature (HKS) and dihedral angles as the input, replaces the convolution module of a ResNet with Multi-layer Perceptron (MLP), and utilizes layer normalization (LN) to perform the normalization of the layers. The all-MLP architecture operates in an end-to-end fashion and does not include a pooling module. Extensive experimental results on the mesh classification/segmentation tasks validate the effectiveness of the all-MLP architecture.
- Scape: shape completion and animation of people. ACM Trans. Graph. 24 (2005), 408–416.
- Geometric deep learning: Grids, groups, graphs, geodesics, and gauges. ArXiv abs/2104.13478 (2021).
- Layer normalization. ArXiv abs/1607.06450 (2016).
- Learning class-specific descriptors for deformable shapes using localized spectral convolutional networks. Computer Graphics Forum 34 (2015).
- Faust: Dataset and evaluation for 3d mesh registration. 2014 IEEE Conference on Computer Vision and Pattern Recognition (2014), 3794–3801.
- An image is worth 16x16 words: Transformers for image recognition at scale. ArXiv abs/2010.11929 (2021).
- Deep geometric functional maps: Robust feature learning for shape correspondence. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (2020), pp. 8592–8601.
- Laplacian2mesh: Laplacian-based mesh understanding. IEEE Transactions on Visualization and Computer Graphics (2023), 1–13.
- Gwcnn: A metric alignment layer for deep shape analysis. Computer Graphics Forum 36 (2017).
- Meshnet: Mesh neural network for 3d shape representation. In Proceedings of the AAAI Conference on Artificial Intelligence (2019), vol. 33, pp. 8279–8286.
- Shape retrieval contest 2007: Watertight models track. SHREC competition 8, 7 (2007), 7.
- Pct: Point cloud transformer. Comput. Vis. Media 7 (2021), 187–199.
- 3d-coded: 3d correspondences by deep deformation. In Proceedings of the european conference on computer vision (ECCV) (2018), pp. 230–246.
- Meshcnn: a network with an edge. ACM Transactions on Graphics (TOG) 38 (2019), 1 – 12.
- Curvanet: Geometric deep learning based on directional curvature for 3d shape analysis. Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining (2020).
- Subdivision-based mesh convolution networks. ACM Trans. Graph. 41, 3 (mar 2022).
- Deep residual learning for image recognition. 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) (2016), 770–778.
- Ioffe S.: Batch renormalization: Towards reducing minibatch dependence in batch-normalized models. In NIPS (2017).
- Klokov R., Lempitsky V. S.: Escape from cells: Deep kd-networks for the recognition of 3d point cloud models. 2017 IEEE International Conference on Computer Vision (ICCV) (2017), 863–872.
- A multi-view recurrent neural network for 3d mesh segmentation. Comput. Graph. 66 (2017), 103–112.
- Pointcnn: Convolution on x-transformed points. In NeurIPS (2018).
- So-net: Self-organizing network for point cloud analysis. In Proceedings of the IEEE conference on computer vision and pattern recognition (2018), pp. 9397–9406.
- Shrec ’11 track: Shape retrieval on non-rigid 3d watertight meshes. In 3DOR@Eurographics (2011).
- Loop C. T.: Smooth subdivision surfaces based on triangles. In Masters Thesis, Department of Mathematics, University of Utah (January 1987) (1987).
- Deep functional maps: Structured prediction for dense shape correspondence. In Proceedings of the IEEE international conference on computer vision (2017), pp. 5659–5667.
- Lahav A., Tal A.: Meshwalker: Deep mesh understanding by random walks. ACM Trans. Graph. 39 (2020), 263:1–263:13.
- Convolutional neural networks on surfaces via seamless toric covers. ACM Transactions on Graphics (TOG) 36 (2017), 1 – 10.
- Field convolutions for surface cnns. 2021 IEEE/CVF International Conference on Computer Vision (ICCV) (2021), 9981–9991.
- Primal-dual mesh convolutional neural networks. ArXiv abs/2010.12455 (2020).
- Zoomout: spectral upsampling for efficient shape correspondence. ACM Transactions on Graphics (TOG) 38, 6 (2019), 1–14.
- Vnect: Real-time 3d human pose estimation with a single rgb camera. ACM Trans. Graph. 36 (2017), 44:1–44:14.
- Deep learning-based semantic vessel graph extraction for intracranial aneurysm rupture risk management. International Journal of Computer Assisted Radiology and Surgery 18 (2023), 517 – 525.
- Functional maps: a flexible representation of maps between shapes. ACM Transactions on Graphics (ToG) 31, 4 (2012), 1–11.
- Poulenard A., Ovsjanikov M.: Multi-directional geodesic neural networks via equivariant convolution. ACM Transactions on Graphics (TOG) 37, 6 (2018), 1–14.
- Pointnet: Deep learning on point sets for 3d classification and segmentation. 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) (2017), 77–85.
- Pointnet++: Deep hierarchical feature learning on point sets in a metric space. In NIPS (2017).
- You only look once: Unified, real-time object detection. 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) (2016), 779–788.
- U-net: Convolutional networks for biomedical image segmentation. In MICCAI (2015).
- Part-based mesh segmentation: a survey. In Computer Graphics Forum, vol. 37, Wiley Online Library, pp. 235–274.
- Continuous and orientation-preserving correspondences via functional maps. ACM Transactions on Graphics (TOG) 37 (2018), 1 – 16.
- Unsupervised deep learning for structured shape matching. In Proceedings of the IEEE/CVF International Conference on Computer Vision (2019), pp. 1617–1627.
- Octnet: Learning deep 3d representations at high resolutions. 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) (2017), 6620–6629.
- Diffusionnet: Discretization agnostic learning on surfaces. ACM Trans. Graph. 41, 3 (mar 2022).
- Deep learning 3d shape surfaces using geometry images. In ECCV (2016).
- Deeppano: Deep panoramic representation for 3-d shape recognition. IEEE Signal Processing Letters 22 (2015), 2339–2343.
- Multi-view convolutional neural networks for 3d shape recognition. 2015 IEEE International Conference on Computer Vision (ICCV) (2015), 945–953.
- Smirnov D., Solomon J. M.: Hodgenet: Learning spectral geometry on triangle meshes. ACM Trans. Graph. 40 (2021), 166:1–166:11.
- Instance normalization: The missing ingredient for fast stylization. ArXiv abs/1607.08022 (2016).
- Articulated mesh animation from multi-view silhouettes. ACM SIGGRAPH 2008 papers (2008).
- Active co-analysis of a set of shapes. ACM Transactions on Graphics (TOG) 31 (2012), 1 – 10.
- ConvNeXt V2: Co-designing and Scaling ConvNets with Masked Autoencoders. arXiv e-prints (Jan. 2023), arXiv:2301.00808.
- Cnns on surfaces using rotation-equivariant features. ACM Transactions on Graphics (TOG) 39 (2020), 92:1 – 92:12.
- Wu Y., He K.: Group Normalization. arXiv e-prints (Mar. 2018), arXiv:1803.08494.
- O-cnn. ACM Transactions on Graphics (TOG) 36 (2017), 1 – 11.
- Pointconv: Deep convolutional networks on 3d point clouds. In Proceedings of the IEEE/CVF Conference on computer vision and pattern recognition (2019), pp. 9621–9630.
- 3d shapenets: A deep representation for volumetric shapes. 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) (2015), 1912–1920.
- Dynamic graph cnn for learning on point clouds. ACM Transactions on Graphics (TOG) 38 (2019), 1 – 12.
- Adaptive O-CNN: A Patch-based Deep Representation of 3D Shapes. ACM Transactions on Graphics (SIGGRAPH Asia) 37, 6 (2018).
- Directionally convolutional networks for 3d shape segmentation. In 2017 IEEE International Conference on Computer Vision (ICCV) (2017), pp. 2717–2726.
- Paconv: Position adaptive convolution with dynamic kernel assembling on point clouds. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (2021), pp. 3173–3182.
- Spidercnn: Deep learning on point sets with parameterized convolutional filters. In Proceedings of the European conference on computer vision (ECCV) (2018), pp. 87–102.
- A survey on deep geometry learning: From a representation perspective. Computational Visual Media 6 (2020), 113–133.
- Geometry sharing network for 3d point cloud classification and segmentation. In Proceedings of the AAAI Conference on Artificial Intelligence (2020), vol. 34, pp. 12500–12507.
- Intra: 3d intracranial aneurysm dataset for deep learning. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (2020), pp. 2656–2666.
- 3d medical point transformer: Introducing convolution to attention networks for medical point cloud analysis. arXiv preprint arXiv:2112.04863 (2021).
- Adaptive graph convolution for point cloud analysis. In Proceedings of the IEEE/CVF international conference on computer vision (2021), pp. 4965–4974.