Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
102 tokens/sec
GPT-4o
59 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
50 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Zero-Shot Multi-Label Topic Inference with Sentence Encoders (2304.07382v1)

Published 14 Apr 2023 in cs.CL and cs.IR

Abstract: Sentence encoders have indeed been shown to achieve superior performances for many downstream text-mining tasks and, thus, claimed to be fairly general. Inspired by this, we performed a detailed study on how to leverage these sentence encoders for the "zero-shot topic inference" task, where the topics are defined/provided by the users in real-time. Extensive experiments on seven different datasets demonstrate that Sentence-BERT demonstrates superior generality compared to other encoders, while Universal Sentence Encoder can be preferred when efficiency is a top priority.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (3)
  1. Souvika Sarkar (10 papers)
  2. Dongji Feng (11 papers)
  3. Shubhra Kanti Karmaker Santu (17 papers)
Citations (2)