Predicting the Geolocation of Tweets Using transformer models on Customized Data (2303.07865v6)

Published 14 Mar 2023 in cs.CL, cs.AI, cs.IR, and cs.LG

Abstract: This research is aimed to solve the tweet/user geolocation prediction task and provide a flexible methodology for the geotagging of textual big data. The suggested approach implements neural networks for NLP to estimate the location as coordinate pairs (longitude, latitude) and two-dimensional Gaussian Mixture Models (GMMs). The scope of proposed models has been finetuned on a Twitter dataset using pretrained Bidirectional Encoder Representations from Transformers (BERT) as base models. Performance metrics show a median error of fewer than 30 km on a worldwide-level, and fewer than 15 km on the US-level datasets for the models trained and evaluated on text features of tweets' content and metadata context. Our source code and data are available at https://github.com/K4TEL/geo-twitter.git

Summary

We haven't generated a summary for this paper yet.

Summarize Now

GitHub

GitHub - K4TEL/geo-twitter (9 stars)

Predicting the Geolocation of Tweets Using transformer models on Customized Data (2303.07865v6)

Summary

Related Papers

GitHub

Tweets