Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
110 tokens/sec
GPT-4o
56 tokens/sec
Gemini 2.5 Pro Pro
44 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Relational Prompt-based Pre-trained Language Models for Social Event Detection (2404.08263v2)

Published 12 Apr 2024 in cs.CL, cs.AI, cs.LG, and cs.SI

Abstract: Social Event Detection (SED) aims to identify significant events from social streams, and has a wide application ranging from public opinion analysis to risk management. In recent years, Graph Neural Network (GNN) based solutions have achieved state-of-the-art performance. However, GNN-based methods often struggle with missing and noisy edges between messages, affecting the quality of learned message embedding. Moreover, these methods statically initialize node embedding before training, which, in turn, limits the ability to learn from message texts and relations simultaneously. In this paper, we approach social event detection from a new perspective based on Pre-trained LLMs (PLMs), and present RPLM_SED (Relational prompt-based Pre-trained LLMs for Social Event Detection). We first propose a new pairwise message modeling strategy to construct social messages into message pairs with multi-relational sequences. Secondly, a new multi-relational prompt-based pairwise message learning mechanism is proposed to learn more comprehensive message representation from message pairs with multi-relational prompts using PLMs. Thirdly, we design a new clustering constraint to optimize the encoding process by enhancing intra-cluster compactness and inter-cluster dispersion, making the message representation more distinguishable. We evaluate the RPLM_SED on three real-world datasets, demonstrating that the RPLM_SED model achieves state-of-the-art performance in offline, online, low-resource, and long-tail distribution scenarios for social event detection tasks.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (87)
  1. Charu C Aggarwal and Karthik Subbian. 2012. Event detection in social streams. In Proceedings of the 2012 SIAM international conference on data mining. 624–635.
  2. Alaa Alharbi and Mark Lee. 2021. Kawarith: an Arabic Twitter corpus for crisis events. In Proceedings of the Sixth Arabic Natural Language Processing Workshop. Association for Computational Linguistics, 42–52.
  3. Hadi Amiri and Hal Daume III. 2016. Short text representation for detecting churn in microblogs. In Proceedings of the AAAI Conference on Artificial Intelligence, Vol. 30. 1–7.
  4. A robust self-learning method for fully unsupervised cross-lingual mappings of word embeddings. In Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers). Association for Computational Linguistics, 789–798.
  5. Farzindar Atefeh and Wael Khreich. 2015. A survey of techniques for event detection in twitter. Computational Intelligence 31, 1 (2015), 132–164.
  6. Beyond trending topics: Real-world event identification on twitter. In Proceedings of the international AAAI conference on web and social media, Vol. 5. 438–441.
  7. Latent dirichlet allocation. Journal of machine Learning research 3, Jan (2003), 993–1022.
  8. Enriching Word Vectors with Subword Information. Transactions of the Association for Computational Linguistics 5 (2017), 135–146.
  9. Density-based clustering based on hierarchical density estimates. In Pacific-Asia conference on knowledge discovery and data mining. Springer, 160–172.
  10. Knowledge-preserving incremental social event detection via heterogeneous gnns. In Proceedings of the Web Conference 2021. 3383–3395.
  11. Hierarchical and incremental structural entropy minimization for unsupervised social event detection. In Proceedings of the AAAI Conference on Artificial Intelligence, Vol. 38. 8255–8264.
  12. Btm: Topic modeling over short texts. IEEE Transactions on Knowledge and Data Engineering 26, 12 (2014), 2928–2941.
  13. Mário Cordeiro. 2012. Twitter event detection: combining wavelet analysis and topic inference summarization. In Doctoral symposium on informatics engineering, Vol. 1. 11–16.
  14. Mário Cordeiro and João Gama. 2016. Online social networks event detection: a survey. Solving Large Scale Learning Tasks. Challenges and Algorithms: Essays Dedicated to Katharina Morik on the Occasion of Her 60th Birthday (2016), 1–41.
  15. MVGAN: Multi-view graph attention network for social event detection. ACM Transactions on Intelligent Systems and Technology (TIST) 12, 3 (2021), 1–24.
  16. Eliciting structural and semantic global knowledge in unsupervised graph contrastive learning. In Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence and Thirty-Fifth Conference on Innovative Applications of Artificial Intelligence and Thirteenth Symposium on Educational Advances in Artificial Intelligence. 7378–7386.
  17. OpenPrompt: An Open-source Framework for Prompt-learning. In Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics: System Demonstrations. Association for Computational Linguistics, 105–113.
  18. A density-based algorithm for discovering clusters in large spatial databases with noise. In Proceedings of the Second International Conference on Knowledge Discovery and Data Mining, Vol. 96. AAAI Press, 226–231.
  19. Normalized mutual information feature selection. IEEE Transactions on neural networks 20, 2 (2009), 189–201.
  20. Real-time event detection on social data streams. In Proceedings of the 25th ACM SIGKDD international conference on knowledge discovery & data mining. 2774–2782.
  21. STREAMCUBE: Hierarchical spatio-temporal hashtag clustering for event exploration over the Twitter stream. In 2015 IEEE 31st international conference on data engineering. IEEE, 1561–1572.
  22. Parameter free bursty events detection in text streams. In Proceedings of the 31st international conference on Very large data bases. ACM, 181–192.
  23. Anuradha Goswami and Ajey Kumar. 2016. A survey of event detection techniques in online social networks. Social Network Analysis and Mining 6 (2016), 1–25.
  24. PPT: Pre-trained Prompt Tuning for Few-shot Learning. In Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers). 8410–8423.
  25. Cross modal distillation for supervision transfer. In Proceedings of the IEEE conference on computer vision and pattern recognition. 2827–2836.
  26. In defense of the triplet loss for person re-identification. arXiv preprint arXiv:1703.07737 (2017), 1–15.
  27. LoRA: Low-Rank Adaptation of Large Language Models. In International Conference on Learning Representations. 1–9.
  28. Adaptive online event detection in news streams. Knowledge-Based Systems 138 (2017), 105–112.
  29. Jacob Devlin Ming-Wei Chang Kenton and Lee Kristina Toutanova. 2019. BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding. In Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics. Association for Computational Linguistics, 4171–4186.
  30. Structured Attention Networks. In International Conference on Learning Representations. 1–21.
  31. Thomas N Kipf and Max Welling. 2016. Semi-Supervised Classification with Graph Convolutional Networks. In International Conference on Learning Representations. 1–12.
  32. Word translation without parallel data. In International Conference on Learning Representations. 1–14.
  33. The Power of Scale for Parameter-Efficient Prompt Tuning. In Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing. Association for Computational Linguistics, 3045–3059.
  34. BART: Denoising Sequence-to-Sequence Pre-training for Natural Language Generation, Translation, and Comprehension. In Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics. Association for Computational Linguistics, 1–10.
  35. Three-dimensional gene map of cancer cell types: Structural entropy minimisation principle for defining tumour subtypes. Scientific reports 6, 1 (2016), 20412.
  36. Learning to Transfer Prompts for Text Generation. In Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies. 3506–3518.
  37. A survey on deep learning event extraction: Approaches and applications. IEEE Transactions on Neural Networks and Learning Systems (2022), 1–22.
  38. Story forest: Extracting events and telling stories from breaking news. ACM Transactions on Knowledge Discovery from Data (TKDD) 14, 3 (2020), 1–28.
  39. Deep learning for community detection: progress, challenges and opportunities. In Proceedings of the Twenty-Ninth International Conference on International Joint Conferences on Artificial Intelligence. 4981–4987.
  40. Event detection via gated multilingual attention mechanism. In Proceedings of the AAAI conference on artificial intelligence, Vol. 32. 4865–4872.
  41. Pre-train, prompt, and predict: A systematic survey of prompting methods in natural language processing. Comput. Surveys 55, 9 (2023), 1–35.
  42. Roberta: A robustly optimized bert pretraining approach. arXiv preprint arXiv:1907.11692 (2019), 1–13.
  43. Event detection and evolution in multi-lingual social streams. Frontiers of Computer Science 14 (2020), 1–15.
  44. Towards unsupervised deep graph structure learning. In Proceedings of the ACM Web Conference 2022. 1392–1403.
  45. Event early embedding: Predicting event volume dynamics at early stage. In Proceedings of the 40th International ACM SIGIR Conference on Research and Development in Information Retrieval. 997–1000.
  46. Tweet insights: a visualization platform to extract temporal insights from twitter. arXiv preprint arXiv:2308.02142 (2023), 1–10.
  47. James MacQueen. 1967. Some methods for classification and analysis of multivariate observations. 1, 14 (1967), 281–297.
  48. Fabrizio Marozzo and Alessandro Bessi. 2018. Analyzing polarization of social media users and news sites during political campaigns. Social Network Analysis and Mining 8 (2018), 1–13.
  49. A French Corpus for Event Detection on Twitter. In Proceedings of the Twelfth Language Resources and Evaluation Conference. 6220–6227.
  50. Building a large-scale corpus for evaluating event detection on twitter. In Proceedings of the 22nd ACM international conference on Information & Knowledge Management. ACM, 409–418.
  51. Efficient estimation of word representations in vector space. In International Conference on Learning Representations. 1–12.
  52. LNMap: Departures from Isomorphic Assumption in Bilingual Lexicon Induction Through Non-Linear Mapping in Latent Space. In Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP). Association for Computational Linguistics, 2712–2723.
  53. SEDTWik: segmentation-based event detection from tweets using Wikipedia. In Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Student Research Workshop. Association for Computational Linguistics, 77–85.
  54. Tahir M Nisar and Man Yeung. 2018. Twitter as a tool for forecasting stock market movements: A short-window event study. The journal of finance and data science 4, 2 (2018), 101–119.
  55. Incremental clustering with vector expansion for online event detection in microblogs. Social Network Analysis and Mining 7 (2017), 1–17.
  56. Fine-grained event categorization with heterogeneous graph convolutional networks. In Proceedings of the 28th International Joint Conference on Artificial Intelligence. 3238–3245.
  57. Large-scale hierarchical text classification with recursively regularized deep graph-cnn. In Proceedings of the 2018 world wide web conference. 1063–1072.
  58. Streaming social event detection and evolution discovery in heterogeneous information networks. ACM Transactions on Knowledge Discovery from Data (TKDD) 15, 5 (2021), 1–33.
  59. Reinforced, incremental and cross-lingual event detection from social messages. IEEE Transactions on Pattern Analysis and Machine Intelligence 45, 1 (2022), 980–998.
  60. Prompt Based Tri-Channel Graph Convolution Neural Network for Aspect Sentiment Triplet Extraction. In Proceedings of the 2024 SIAM International Conference on Data Mining (SDM). 1–9.
  61. Glove: Global vectors for word representation. In Proceedings of the 2014 conference on empirical methods in natural language processing (EMNLP). 1532–1543.
  62. Language Models as Knowledge Bases?. In Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP). Association for Computational Linguistics, 2463–2473.
  63. Improving language understanding by generative pre-training. (2018), 1–12.
  64. From known to unknown: quality-aware self-improving graph neural network for open set social event detection. In Proceedings of the 31st ACM International Conference on Information & Knowledge Management. 1696–1705.
  65. Evidential temporal-aware graph-based social event detection via dempster-shafer theory. In 2022 IEEE International Conference on Web Services (ICWS). IEEE, 331–336.
  66. Uncertainty-guided Boundary Learning for Imbalanced Social Event Detection. IEEE Transactions on Knowledge and Data Engineering (2023), 1–15.
  67. Transferring knowledge distillation for multilingual social event detection. arXiv preprint arXiv:2108.03084 (2021), 1–31.
  68. Event detection based on open information extraction and ontology. Journal of Information and Telecommunication 4, 3 (2020), 383–403.
  69. Kari Sentz and Scott Ferson. 2002. Combination of evidence in Dempster-Shafer theory. (2002), 1–96.
  70. TIARA: Multi-grained Retrieval for Robust Question Answering over Large Knowledge Base. In Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing. 8108–8121.
  71. A comprehensive survey on community detection with deep learning. IEEE Transactions on Neural Networks and Learning Systems (2022), 1–29.
  72. How to fine-tune bert for text classification?. In Chinese computational linguistics: 18th China national conference, CCL 2019, Kunming, China, October 18–20, 2019, proceedings 18. Springer, 194–206.
  73. Attention is all you need. Advances in neural information processing systems 30, 5998–6008.
  74. Information theoretic measures for clusterings comparison: is a correction for chance necessary? (2009), 1073–1080.
  75. Zhongqing Wang and Yue Zhang. 2017. A neural model for joint event detection and summarization. In Proceedings of the 26th International Joint Conference on Artificial Intelligence. 4158–4164.
  76. MenatQA: A New Dataset for Testing the Temporal Comprehension and Reasoning Abilities of Large Language Models. In Findings of the Association for Computational Linguistics: EMNLP 2023. 1434–1447.
  77. Topicsketch: Real-time bursty topic detection from twitter. IEEE Transactions on Knowledge and Data Engineering 28, 8 (2016), 2216–2229.
  78. Hashtag-based sub-event discovery using mutually generative lda in twitter. In Proceedings of the AAAI conference on artificial intelligence, Vol. 30. 2666––2672.
  79. A probabilistic model for bursty topic discovery in microblogs. In Proceedings of the AAAI Conference on Artificial Intelligence, Vol. 29. AAAI Press, 353–359.
  80. GraphFormers: GNN-nested Transformers for Representation Learning on Textual Graph. In Advances in Neural Information Processing Systems, Vol. 34. Curran Associates, Inc., 28798–28810.
  81. Ring: Real-time emerging anomaly monitoring system over text streams. IEEE Transactions on Big Data 5, 4 (2017), 506–519.
  82. Triovecevent: Embedding-based online local event detection in geo-tagged tweet streams. In Proceedings of the 23rd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. 595–604.
  83. New event detection based on indexing-tree and named entity. In Proceedings of the 30th annual international ACM SIGIR conference on Research and development in information retrieval. 215–222.
  84. TwHIN-BERT: A socially-enriched pre-trained language model for multilingual tweet representations at twitter. In Proceedings of the 29th ACM SIGKDD conference on knowledge discovery and data mining. ACM, 5597–5607.
  85. Zizhuo Zhang and Bang Wang. 2023. Prompt learning for news recommendation. In Proceedings of the 46th International ACM SIGIR Conference on Research and Development in Information Retrieval. 227–237.
  86. Comparing twitter and traditional media using topic models. In Advances in Information Retrieval: 33rd European Conference on IR Research, ECIR 2011, Dublin, Ireland, April 18-21, 2011. Proceedings 33. Springer, 338–349.
  87. Xiangmin Zhou and Lei Chen. 2014. Event detection over twitter social media streams. The VLDB journal 23, 3 (2014), 381–400.
User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (8)
  1. Pu Li (35 papers)
  2. Xiaoyan Yu (22 papers)
  3. Hao Peng (291 papers)
  4. Yantuan Xian (2 papers)
  5. Linqin Wang (2 papers)
  6. Li Sun (135 papers)
  7. Jingyun Zhang (14 papers)
  8. Philip S. Yu (592 papers)
Citations (2)

Summary

We haven't generated a summary for this paper yet.