Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
129 tokens/sec
GPT-4o
28 tokens/sec
Gemini 2.5 Pro Pro
42 tokens/sec
o3 Pro
4 tokens/sec
GPT-4.1 Pro
38 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

The devil is in discretization discrepancy. Robustifying Differentiable NAS with Single-Stage Searching Protocol (2405.16610v1)

Published 26 May 2024 in cs.CV, cs.AI, cs.LG, and cs.NE

Abstract: Neural Architecture Search (NAS) has been widely adopted to design neural networks for various computer vision tasks. One of its most promising subdomains is differentiable NAS (DNAS), where the optimal architecture is found in a differentiable manner. However, gradient-based methods suffer from the discretization error, which can severely damage the process of obtaining the final architecture. In our work, we first study the risk of discretization error and show how it affects an unregularized supernet. Then, we present that penalizing high entropy, a common technique of architecture regularization, can hinder the supernet's performance. Therefore, to robustify the DNAS framework, we introduce a novel single-stage searching protocol, which is not reliant on decoding a continuous architecture. Our results demonstrate that this approach outperforms other DNAS methods by achieving 75.3% in the searching stage on the Cityscapes validation dataset and attains performance 1.1% higher than the optimal network of DCNAS on the non-dense search space comprising short connections. The entire training process takes only 5.5 GPU days due to the weight reuse, and yields a computationally efficient architecture. Additionally, we propose a new dataset split procedure, which substantially improves results and prevents architecture degeneration in DARTS.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (47)
  1. Neural optimizer search with reinforcement learning. In ICML, 2017.
  2. GOLD-NAS: gradual, one-level, differentiable. CoRR, abs/2007.03331, 2020.
  3. Proxylessnas: Direct neural architecture search on target task and hardware. In ICLR, 2019.
  4. Searching for efficient multi-scale architectures for dense image prediction. In NIPS, 2018a.
  5. Encoder-decoder with atrous separable convolution for semantic image segmentation. In ECCV, 2018b.
  6. Rspnet: Relative speed perception for unsupervised video representation learning. In AAAI, 2021.
  7. Stabilizing differentiable architecture search via perturbation-based regularization. In ICML, 2020.
  8. Progressive differentiable architecture search: Bridging the depth gap between search and evaluation. In ICCV, 2019.
  9. Fair DARTS: eliminating unfair advantages in differentiable architecture search. In ECCV, 2020.
  10. DARTS-: robustly stepping out of performance collapse without indicators. In ICLR, 2021.
  11. The cityscapes dataset for semantic urban scene understanding. In CVPR, 2016.
  12. Imagenet: A large-scale hierarchical image database. In CVPR, 2009.
  13. Residual conv-deconv grid network for semantic segmentation. In BMVC, 2017.
  14. DOTS: decoupling operation and topology in differentiable architecture search. In CVPR, 2021.
  15. Single path one-shot neural architecture search with uniform sampling. In ECCV, 2020.
  16. Milenas: Efficient neural architecture search via mixed-level reformulation. In CVPR, 2020.
  17. Searching for mobilenetv3. In ICCV, 2019.
  18. Adam: A method for stochastic optimization. In ICLR, 2015.
  19. DARTS+: improved differentiable architecture search with early stopping. CoRR, abs/1909.06035, 2019.
  20. Graph-guided architecture search for real-time semantic segmentation. In CVPR, 2020.
  21. Progressive neural architecture search. In ECCV, 2018.
  22. Auto-deeplab: Hierarchical neural architecture search for semantic image segmentation. In CVPR, 2019a.
  23. DARTS: differentiable architecture search. In ICLR, 2019b.
  24. Megdet: A large mini-batch object detector. In CVPR, 2018.
  25. Efficient neural architecture search via parameter sharing. In ICML, 2018.
  26. Full-resolution residual networks for semantic segmentation in street scenes. In CVPR, 2017.
  27. Large-scale evolution of image classifiers. In ICML, 2017.
  28. Regularized evolution for image classifier architecture search. In AAAI, 2019.
  29. Mnasnet: Platform-aware neural architecture search for mobile. In CVPR, 2019.
  30. Discretization-aware architecture search. Pattern Recognit., 2021.
  31. Fbnetv2: Differentiable neural architecture search for spatial and channel dimensions. In CVPR, 2020.
  32. Rethinking architecture selection in differentiable NAS. In ICLR, 2021.
  33. Fbnet: Hardware-aware efficient convnet design via differentiable neural architecture search. In CVPR, 2019a.
  34. Sparsemask: Differentiable connectivity learning for dense image prediction. In ICCV, 2019b.
  35. SNAS: stochastic neural architecture search. In ICLR, 2019.
  36. PC-DARTS: partial channel connections for memory-efficient architecture search. In ICLR, 2020.
  37. CARS: continuous evolution for efficient neural architecture search. In CVPR, 2020.
  38. β𝛽\betaitalic_β-darts: Beta-decay regularization for differentiable architecture search. In CVPR, 2022.
  39. Object-contextual representations for semantic segmentation. In ECCV, 2020.
  40. Understanding and robustifying differentiable architecture search. In ICLR, 2020.
  41. Interpreting operation selection in differentiable architecture search: A perspective from influence-directed explanations. In NIPS, 2022.
  42. DCNAS: densely connected neural architecture search for semantic image segmentation. In CVPR, 2021.
  43. Customizable architecture search for semantic segmentation. In CVPR, 2019.
  44. Pyramid scene parsing network. In CVPR, 2017.
  45. Practical block-wise neural network architecture generation. In CVPR, 2018.
  46. Neural architecture search with reinforcement learning. In ICLR, 2017.
  47. Learning transferable architectures for scalable image recognition. In CVPR, 2018.

Summary

We haven't generated a summary for this paper yet.

X Twitter Logo Streamline Icon: https://streamlinehq.com

Tweets