Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
119 tokens/sec
GPT-4o
56 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

A Point in the Right Direction: Vector Prediction for Spatially-aware Self-supervised Volumetric Representation Learning (2211.08533v1)

Published 15 Nov 2022 in cs.CV

Abstract: High annotation costs and limited labels for dense 3D medical imaging tasks have recently motivated an assortment of 3D self-supervised pretraining methods that improve transfer learning performance. However, these methods commonly lack spatial awareness despite its centrality in enabling effective 3D image analysis. More specifically, position, scale, and orientation are not only informative but also automatically available when generating image crops for training. Yet, to date, no work has proposed a pretext task that distills all key spatial features. To fulfill this need, we develop a new self-supervised method, VectorPOSE, which promotes better spatial understanding with two novel pretext tasks: Vector Prediction (VP) and Boundary-Focused Reconstruction (BFR). VP focuses on global spatial concepts (i.e., properties of 3D patches) while BFR addresses weaknesses of recent reconstruction methods to learn more effective local representations. We evaluate VectorPOSE on three 3D medical image segmentation tasks, showing that it often outperforms state-of-the-art methods, especially in limited annotation settings.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (6)
  1. Yejia Zhang (12 papers)
  2. Pengfei Gu (20 papers)
  3. Nishchal Sapkota (10 papers)
  4. Hao Zheng (200 papers)
  5. Peixian Liang (12 papers)
  6. Danny Z. Chen (72 papers)
Citations (2)

Summary

We haven't generated a summary for this paper yet.