2000 character limit reached
Zero-Shot Multi-Label Topic Inference with Sentence Encoders (2304.07382v1)
Published 14 Apr 2023 in cs.CL and cs.IR
Abstract: Sentence encoders have indeed been shown to achieve superior performances for many downstream text-mining tasks and, thus, claimed to be fairly general. Inspired by this, we performed a detailed study on how to leverage these sentence encoders for the "zero-shot topic inference" task, where the topics are defined/provided by the users in real-time. Extensive experiments on seven different datasets demonstrate that Sentence-BERT demonstrates superior generality compared to other encoders, while Universal Sentence Encoder can be preferred when efficiency is a top priority.
- Souvika Sarkar (10 papers)
- Dongji Feng (11 papers)
- Shubhra Kanti Karmaker Santu (17 papers)