Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
119 tokens/sec
GPT-4o
56 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Adaptive Label Smoothing To Regularize Large-Scale Graph Training (2108.13555v1)

Published 30 Aug 2021 in cs.LG and cs.AI

Abstract: Graph neural networks (GNNs), which learn the node representations by recursively aggregating information from its neighbors, have become a predominant computational tool in many domains. To handle large-scale graphs, most of the existing methods partition the input graph into multiple sub-graphs (e.g., through node clustering) and apply batch training to save memory cost. However, such batch training will lead to label bias within each batch, and then result in over-confidence in model predictions. Since the connected nodes with positively related labels tend to be assigned together, the traditional cross-entropy minimization process will attend on the predictions of biased classes in the batch, and may intensify the overfitting issue. To overcome the label bias problem, we propose the adaptive label smoothing (ALS) method to replace the one-hot hard labels with smoothed ones, which learns to allocate label confidences from the biased classes to the others. Specifically, ALS propagates node labels to aggregate the neighborhood label distribution in a pre-processing step, and then updates the optimal smoothed labels online to adapt to specific graph structure. Experiments on the real-world datasets demonstrate that ALS can be generally applied to the main scalable learning frameworks to calibrate the biased labels and improve generalization performances.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (8)
  1. Kaixiong Zhou (52 papers)
  2. Ninghao Liu (98 papers)
  3. Fan Yang (878 papers)
  4. Zirui Liu (58 papers)
  5. Rui Chen (310 papers)
  6. Li Li (657 papers)
  7. Soo-Hyun Choi (7 papers)
  8. Xia Hu (186 papers)
Citations (17)

Summary

We haven't generated a summary for this paper yet.