NAS-Bench-NLP: Neural Architecture Search Benchmark for Natural Language Processing (2006.07116v1)

Published 12 Jun 2020 in cs.LG, cs.CL, and stat.ML

Abstract: Neural Architecture Search (NAS) is a promising and rapidly evolving research area. Training a large number of neural networks requires an exceptional amount of computational power, which makes NAS unreachable for those researchers who have limited or no access to high-performance clusters and supercomputers. A few benchmarks with precomputed neural architectures performances have been recently introduced to overcome this problem and ensure more reproducible experiments. However, these benchmarks are only for the computer vision domain and, thus, are built from the image datasets and convolution-derived architectures. In this work, we step outside the computer vision domain by leveraging the LLMing task, which is the core of NLP. Our main contribution is as follows: we have provided search space of recurrent neural networks on the text datasets and trained 14k architectures within it; we have conducted both intrinsic and extrinsic evaluation of the trained models using datasets for semantic relatedness and language understanding evaluation; finally, we have tested several NAS algorithms to demonstrate how the precomputed results can be utilized. We believe that our results have high potential of usage for both NAS and NLP communities.

Authors (6)

Nikita Klyuchnikov (10 papers)
Ilya Trofimov (16 papers)
Ekaterina Artemova (53 papers)
Mikhail Salnikov (11 papers)
Maxim Fedorov (3 papers)
Evgeny Burnaev (189 papers)

Citations (96)

View on Semantic Scholar

Summary

We haven't generated a summary for this paper yet.

Summarize Now

NAS-Bench-NLP: Neural Architecture Search Benchmark for Natural Language Processing (2006.07116v1)

Summary

Related Papers