Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
110 tokens/sec
GPT-4o
56 tokens/sec
Gemini 2.5 Pro Pro
44 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

AIO-P: Expanding Neural Performance Predictors Beyond Image Classification (2211.17228v2)

Published 30 Nov 2022 in cs.CV and cs.LG

Abstract: Evaluating neural network performance is critical to deep neural network design but a costly procedure. Neural predictors provide an efficient solution by treating architectures as samples and learning to estimate their performance on a given task. However, existing predictors are task-dependent, predominantly estimating neural network performance on image classification benchmarks. They are also search-space dependent; each predictor is designed to make predictions for a specific architecture search space with predefined topologies and set of operations. In this paper, we propose a novel All-in-One Predictor (AIO-P), which aims to pretrain neural predictors on architecture examples from multiple, separate computer vision (CV) task domains and multiple architecture spaces, and then transfer to unseen downstream CV tasks or neural architectures. We describe our proposed techniques for general graph representation, efficient predictor pretraining and knowledge infusion techniques, as well as methods to transfer to downstream tasks/spaces. Extensive experimental results show that AIO-P can achieve Mean Absolute Error (MAE) and Spearman's Rank Correlation (SRCC) below 1% and above 0.5, respectively, on a breadth of target downstream CV tasks with or without fine-tuning, outperforming a number of baselines. Moreover, AIO-P can directly transfer to new architectures not seen during training, accurately rank them and serve as an effective performance estimator when paired with an algorithm designed to preserve performance while reducing FLOPs.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (54)
  1. TensorFlow: Large-Scale Machine Learning on Heterogeneous Systems.
  2. Zero-Cost Proxies for Lightweight NAS. In International Conference on Learning Representations (ICLR).
  3. 2D Human Pose Estimation: New Benchmark and State of the Art Analysis. In IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
  4. UniPose: Unified Human Pose Estimation in Single Images and Videos. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 7035–7044.
  5. Once for All: Train One Network and Specialize it for Efficient Deployment. In International Conference on Learning Representations.
  6. ProxylessNAS: Direct Neural Architecture Search on Target Task and Hardware. In International Conference on Learning Representations.
  7. Generative Adversarial Neural Architecture Search. In Zhou, Z.-H., ed., Proceedings of the Thirtieth International Joint Conference on Artificial Intelligence, IJCAI-21, 2227–2234. International Joint Conferences on Artificial Intelligence Organization. Main Track.
  8. Encoder-Decoder With Atrous Separable Convolution For Semantic Image Segmentation. In Proceedings of the European Conference on Computer Vision (ECCV), 801–818.
  9. Chollet, F. 2017. Xception: Deep Learning with Depthwise Separable Convolutions. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 1251–1258.
  10. The Cityscapes Dataset for Semantic Urban Scene Understanding. In Proc. of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
  11. Learning Versatile Neural Architectures by Propagating Network Codes. In International Conference on Learning Representations.
  12. NAS-Bench-201: Extending the Scope of Reproducible Neural Architecture Search. In International Conference on Learning Representations.
  13. TransNAS-Bench-101: Improving Transferability and Generalizability of Cross-Task Neural Architecture Search. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 5251–5260.
  14. Neural Architecture Search: A Survey. J. Mach. Learn. Res., 20(55): 1–21.
  15. The Pascal Visual Object Classes Challenge: A Retrospective. International Journal of Computer Vision, 111(1): 98–136.
  16. Mask R-CNN. In Proceedings of the IEEE International Conference on Computer Vision, 2961–2969.
  17. Deep Residual Learning for Image Recognition. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 770–778.
  18. Identity Mappings in Deep Residual Networks. In European Conference on Computer Vision, 630–645. Springer.
  19. Searching For MobileNetV3. In Proceedings of the IEEE/CVF International Conference on Computer Vision, 1314–1324.
  20. Squeeze-and-Excitation Networks. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 7132–7141.
  21. Learning Effective Human Pose Estimation from Inaccurate Annotation. In IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 1465–1472. IEEE.
  22. Panoptic Feature Pyramid Networks. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 6399–6408.
  23. Learning Multiple Layers of Features From Tiny Images. Technical Report.
  24. Feature Pyramid Networks for Object Detection. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2117–2125.
  25. Microsoft COCO: Common Objects in Context. In Computer Vision – ECCV 2014, 740–755. Cham: Springer International Publishing.
  26. Auto-DeepLab: Hierarchical Neural Architecture Search for Semantic Image Segmentation. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 82–92.
  27. DARTS: Differentiable Architecture Search. In International Conference on Learning Representations (ICLR).
  28. SGDR: Stochastic Gradient Descent with Warm Restarts. In 5th International Conference on Learning Representations, ICLR 2017, Toulon, France, April 24-26, 2017, Conference Track Proceedings.
  29. One Proxy Device is Enough for Hardware-Aware Neural Architecture Search. Proceedings of the ACM on Measurement and Analysis of Computing Systems, 5(3): 1–34.
  30. TNASP: A Transformer-based NAS Predictor with a Self-Evolution Framework. In Ranzato, M.; Beygelzimer, A.; Dauphin, Y.; Liang, P.; and Vaughan, J. W., eds., Advances in Neural Information Processing Systems, volume 34, 15125–15137. Curran Associates, Inc.
  31. Semi-Supervised Neural Architecture Search. Advances in Neural Information Processing Systems, 33: 10547–10557.
  32. L22{}^{2}start_FLOATSUPERSCRIPT 2 end_FLOATSUPERSCRIPTNAS: Learning to Optimize Neural Architectures via Continuous-Action Reinforcement Learning. In Proceedings of the 30th ACM International Conference on Information & Knowledge Management, 1284–1293.
  33. Profiling Neural Blocks and Design Spaces for Mobile Neural Architecture Search. In Proceedings of the 30th ACM International Conference on Information & Knowledge Management, 4026–4035.
  34. Exploring Neural Architecture Search Space via Deep Deterministic Sampling. IEEE Access, 9: 110962–110974.
  35. Weisfeiler and Leman Go Neural: Higher-Order Graph Neural Networks. In Proceedings of the AAAI Conference on Artificial Intelligence, volume 33, 4602–4609.
  36. Automated Super-Network Generation for Scalable Neural Architecture Search. In International Conference on Automated Machine Learning, 5–1. PMLR.
  37. Efficient Neural Architecture Search via Parameters Sharing. In International Conference on Machine Learning, 4095–4104. PMLR.
  38. Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks. Advances in Neural Information Processing Systems, 28.
  39. ImageNet Large Scale Visual Recognition Challenge. International Journal of Computer Vision, 115(3): 211–252.
  40. Sergio Guadarrama, Nathan Silberman. 2016. TensorFlow-Slim: A Lightweight Library for Defining, Training and Evaluating Complex Models in TensorFlow.
  41. Inception-V4, Inception-ResNet and the Impact of Residual Connections on Learning. In Thirty-first AAAI Conference on Artificial Intelligence.
  42. EfficientNet: Rethinking Model Scaling for Convolutional Neural Networks. In International Conference on Machine Learning, 6105–6114. PMLR.
  43. Pruning Neural Networks Without Any Data by Iteratively Conserving Synaptic Flow. Advances in Neural Information Processing Systems, 33: 6377–6389.
  44. K-Adapter: Infusing Knowledge into Pre-Trained Models with Adapters. In Findings of the Association for Computational Linguistics: ACL-IJCNLP 2021, 1405–1418. Online: Association for Computational Linguistics.
  45. Neural Predictor for Neural Architecture Search. In European Conference on Computer Vision, 660–676. Springer.
  46. BANANAS: Bayesian Optimization with Neural Architectures for Neural Architecture Search. In Proceedings of the AAAI Conference on Artificial Intelligence, volume 35, 10293–10301.
  47. Stronger NAS with Weaker Predictors. Advances in Neural Information Processing Systems, 34: 28904–28918.
  48. Detectron2. https://github.com/facebookresearch/detectron2.
  49. NAS-Bench-101: Towards Reproducible Neural Architecture Search. In International Conference on Machine Learning, 7105–7114.
  50. Surrogate NAS Benchmarks: Going Beyond the Limited Search Spaces of Tabular NAS Benchmarks. In International Conference on Learning Representations.
  51. Deep Learning-Based Human Pose Estimation: A Survey.
  52. Scene Parsing Through ADE20K Dataset. In 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 5122–5130.
  53. Towards 3D Human Pose Estimation in the Wild: A Weakly-Supervised Approach. In The IEEE International Conference on Computer Vision (ICCV).
  54. Neural Architecture Search with Reinforcement Learning. In International Conference on Learning Representations.
User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (9)
  1. Keith G. Mills (14 papers)
  2. Di Niu (67 papers)
  3. Mohammad Salameh (20 papers)
  4. Weichen Qiu (2 papers)
  5. Fred X. Han (10 papers)
  6. Puyuan Liu (3 papers)
  7. Jialin Zhang (87 papers)
  8. Wei Lu (325 papers)
  9. Shangling Jui (36 papers)
Citations (6)

Summary

We haven't generated a summary for this paper yet.