Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
119 tokens/sec
GPT-4o
56 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Point Contrastive Prediction with Semantic Clustering for Self-Supervised Learning on Point Cloud Videos (2308.09247v1)

Published 18 Aug 2023 in cs.CV and cs.AI

Abstract: We propose a unified point cloud video self-supervised learning framework for object-centric and scene-centric data. Previous methods commonly conduct representation learning at the clip or frame level and cannot well capture fine-grained semantics. Instead of contrasting the representations of clips or frames, in this paper, we propose a unified self-supervised framework by conducting contrastive learning at the point level. Moreover, we introduce a new pretext task by achieving semantic alignment of superpoints, which further facilitates the representations to capture semantic cues at multiple scales. In addition, due to the high redundancy in the temporal dimension of dynamic point clouds, directly conducting contrastive learning at the point level usually leads to massive undesired negatives and insufficient modeling of positive representations. To remedy this, we propose a selection strategy to retain proper negatives and make use of high-similarity samples from other instances as positive supplements. Extensive experiments show that our method outperforms supervised counterparts on a wide range of downstream tasks and demonstrates the superior transferability of the learned representations.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (6)
  1. Xiaoxiao Sheng (4 papers)
  2. Zhiqiang Shen (172 papers)
  3. Gang Xiao (18 papers)
  4. Longguang Wang (48 papers)
  5. Yulan Guo (89 papers)
  6. HeHe Fan (46 papers)
Citations (5)

Summary

We haven't generated a summary for this paper yet.