Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
110 tokens/sec
GPT-4o
56 tokens/sec
Gemini 2.5 Pro Pro
44 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Label-Aware Distribution Calibration for Long-tailed Classification (2111.04901v1)

Published 9 Nov 2021 in cs.LG and cs.CV

Abstract: Real-world data usually present long-tailed distributions. Training on imbalanced data tends to render neural networks perform well on head classes while much worse on tail classes. The severe sparseness of training instances for the tail classes is the main challenge, which results in biased distribution estimation during training. Plenty of efforts have been devoted to ameliorating the challenge, including data re-sampling and synthesizing new training instances for tail classes. However, no prior research has exploited the transferable knowledge from head classes to tail classes for calibrating the distribution of tail classes. In this paper, we suppose that tail classes can be enriched by similar head classes and propose a novel distribution calibration approach named as label-Aware Distribution Calibration LADC. LADC transfers the statistics from relevant head classes to infer the distribution of tail classes. Sampling from calibrated distribution further facilitates re-balancing the classifier. Experiments on both image and text long-tailed datasets demonstrate that LADC significantly outperforms existing methods.The visualization also shows that LADC provides a more accurate distribution estimation.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (7)
  1. Chaozheng Wang (28 papers)
  2. Shuzheng Gao (14 papers)
  3. Cuiyun Gao (97 papers)
  4. Pengyun Wang (14 papers)
  5. Wenjie Pei (56 papers)
  6. Lujia Pan (27 papers)
  7. Zenglin Xu (145 papers)
Citations (18)

Summary

We haven't generated a summary for this paper yet.