Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
110 tokens/sec
GPT-4o
56 tokens/sec
Gemini 2.5 Pro Pro
44 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

SMERC: Social media event response clustering using textual and temporal information (1811.05063v1)

Published 13 Nov 2018 in cs.SI, cs.IR, physics.data-an, and stat.AP

Abstract: Tweet clustering for event detection is a powerful modern method to automate the real-time detection of events. In this work we present a new tweet clustering approach, using a probabilistic approach to incorporate temporal information. By analysing the distribution of time gaps between tweets we show that the gaps between pairs of related tweets exhibit exponential decay, whereas the gaps between unrelated tweets are approximately uniform. Guided by this insight, we use probabilistic arguments to estimate the likelihood that a pair of tweets are related, and build an improved clustering method. Our method Social Media Event Response Clustering (SMERC) creates clusters of tweets based on their tendency to be related to a single event. We evaluate our method at three levels: through traditional event prediction from tweet clustering, by measuring the improvement in quality of clusters created, and also comparing the clustering precision and recall with other methods. By applying SMERC to tweets collected during a number of sporting events, we demonstrate that incorporating temporal information leads to state of the art clustering performance.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (5)
  1. Peter Mathews (3 papers)
  2. Caitlin Gray (4 papers)
  3. Lewis Mitchell (56 papers)
  4. Giang T. Nguyen (26 papers)
  5. Nigel G. Bean (6 papers)
Citations (2)

Summary

We haven't generated a summary for this paper yet.