A Comprehensive Analysis of Adapter Efficiency (2305.07491v1)

Published 12 May 2023 in cs.CL

Abstract: Adapters have been positioned as a parameter-efficient fine-tuning (PEFT) approach, whereby a minimal number of parameters are added to the model and fine-tuned. However, adapters have not been sufficiently analyzed to understand if PEFT translates to benefits in training/deployment efficiency and maintainability/extensibility. Through extensive experiments on many adapters, tasks, and languages in supervised and cross-lingual zero-shot settings, we clearly show that for Natural Language Understanding (NLU) tasks, the parameter efficiency in adapters does not translate to efficiency gains compared to full fine-tuning of models. More precisely, adapters are relatively expensive to train and have slightly higher deployment latency. Furthermore, the maintainability/extensibility benefits of adapters can be achieved with simpler approaches like multi-task training via full fine-tuning, which also provide relatively faster training times. We, therefore, recommend that for moderately sized models for NLU tasks, practitioners should rely on full fine-tuning or multi-task training rather than using adapters. Our code is available at https://github.com/AI4Bharat/adapter-efficiency.

PDF Abstract

Summarize Bookmark Chat (Pro)

Authors (6)

Nandini Mundra (2 papers)
Sumanth Doddapaneni (16 papers)
Raj Dabre (65 papers)
Anoop Kunchukuttan (45 papers)
Ratish Puduppully (20 papers)
Mitesh M. Khapra (79 papers)

Citations (5)

View on Semantic Scholar

GitHub

GitHub - AI4Bharat/adapter-efficiency (10 stars)

A Comprehensive Analysis of Adapter Efficiency (2305.07491v1)

Related Papers

GitHub