Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
41 tokens/sec
GPT-4o
59 tokens/sec
Gemini 2.5 Pro Pro
41 tokens/sec
o3 Pro
7 tokens/sec
GPT-4.1 Pro
50 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

DSPoint: Dual-scale Point Cloud Recognition with High-frequency Fusion (2111.10332v4)

Published 19 Nov 2021 in cs.CV, cs.AI, and cs.LG

Abstract: Point cloud processing is a challenging task due to its sparsity and irregularity. Prior works introduce delicate designs on either local feature aggregator or global geometric architecture, but few combine both advantages. We propose Dual-Scale Point Cloud Recognition with High-frequency Fusion (DSPoint) to extract local-global features by concurrently operating on voxels and points. We reverse the conventional design of applying convolution on voxels and attention to points. Specifically, we disentangle point features through channel dimension for dual-scale processing: one by point-wise convolution for fine-grained geometry parsing, the other by voxel-wise global attention for long-range structural exploration. We design a co-attention fusion module for feature alignment to blend local-global modalities, which conducts inter-scale cross-modality interaction by communicating high-frequency coordinates information. Experiments and ablations on widely-adopted ModelNet40, ShapeNet, and S3DIS demonstrate the state-of-the-art performance of our DSPoint.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (6)
  1. Renrui Zhang (100 papers)
  2. Ziyao Zeng (12 papers)
  3. Ziyu Guo (49 papers)
  4. Xinben Gao (1 paper)
  5. Kexue Fu (23 papers)
  6. Jianbo Shi (57 papers)
Citations (24)