Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
41 tokens/sec
GPT-4o
59 tokens/sec
Gemini 2.5 Pro Pro
41 tokens/sec
o3 Pro
7 tokens/sec
GPT-4.1 Pro
50 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

A Comparison of SVM against Pre-trained Language Models (PLMs) for Text Classification Tasks (2211.02563v1)

Published 4 Nov 2022 in cs.CL and cs.LG

Abstract: The emergence of pre-trained LLMs (PLMs) has shown great success in many NLP tasks including text classification. Due to the minimal to no feature engineering required when using these models, PLMs are becoming the de facto choice for any NLP task. However, for domain-specific corpora (e.g., financial, legal, and industrial), fine-tuning a pre-trained model for a specific task has shown to provide a performance improvement. In this paper, we compare the performance of four different PLMs on three public domain-free datasets and a real-world dataset containing domain-specific words, against a simple SVM linear classifier with TFIDF vectorized text. The experimental results on the four datasets show that using PLMs, even fine-tuned, do not provide significant gain over the linear SVM classifier. Hence, we recommend that for text classification tasks, traditional SVM along with careful feature engineering can pro-vide a cheaper and superior performance than PLMs.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (3)
  1. Yasmen Wahba (3 papers)
  2. Nazim Madhavji (5 papers)
  3. John Steinbacher (13 papers)
Citations (16)
X Twitter Logo Streamline Icon: https://streamlinehq.com

Tweets