2000 character limit reached
News Meets Microblog: Hashtag Annotation via Retriever-Generator (2104.08723v1)
Published 18 Apr 2021 in cs.CL
Abstract: Hashtag annotation for microblog posts has been recently formulated as a sequence generation problem to handle emerging hashtags that are unseen in the training set. The state-of-the-art method leverages conversations initiated by posts to enrich contextual information for the short posts. However, it is unrealistic to assume the existence of conversations before the hashtag annotation itself. Therefore, we propose to leverage news articles published before the microblog post to generate hashtags following a Retriever-Generator framework. Extensive experiments on English Twitter datasets demonstrate superior performance and significant advantages of leveraging news articles to generate hashtags.
- Xiuwen Zheng (8 papers)
- Dheeraj Mekala (19 papers)
- Amarnath Gupta (17 papers)
- Jingbo Shang (141 papers)