Imbalanced Multi-label Classification for Business-related Text with Moderately Large Label Spaces (2306.07046v1)

Published 12 Jun 2023 in cs.IR and cs.LG

Abstract: In this study, we compared the performance of four different methods for multi label text classification using a specific imbalanced business dataset. The four methods we evaluated were fine tuned BERT, Binary Relevance, Classifier Chains, and Label Powerset. The results show that fine tuned BERT outperforms the other three methods by a significant margin, achieving high values of accuracy, F1 Score, Precision, and Recall. Binary Relevance also performs well on this dataset, while Classifier Chains and Label Powerset demonstrate relatively poor performance. These findings highlight the effectiveness of fine tuned BERT for multi label text classification tasks, and suggest that it may be a useful tool for businesses seeking to analyze complex and multifaceted texts.

Citations (2)

View on Semantic Scholar

Summary

We haven't generated a summary for this paper yet.

Summarize Now

Imbalanced Multi-label Classification for Business-related Text with Moderately Large Label Spaces (2306.07046v1)

Summary

Related Papers