Predicting the geolocation of tweets using transformer models on customized data
Lutsai K, Lampert C. 2024. Predicting the geolocation of tweets using transformer models on customized data. Journal of Spatial Information Science. (29), 69–99.
Download
Journal Article
| Published
| English
Scopus indexed
Author
Lutsai, Kateryna;
Lampert , ChristophISTA 

Corresponding author has ISTA affiliation
Department
Abstract
This research is aimed to solve the tweet/user geolocation prediction task and provide a flexible methodology for the geo-tagging of textual big data. The suggested approach implements neural networks for natural language processing (NLP) to estimate the location as coordinate pairs (longitude, latitude) and two-dimensional Gaussian Mixture Models (GMMs). The scope of proposed models has been finetuned on a Twitter dataset using pretrained Bidirectional Encoder Representations from Transformers (BERT) as base models. Performance metrics show a median error of fewer than 30 km on a worldwide-level, and fewer than 15 km on the US-level datasets for the models trained and evaluated on text features of tweets' content and metadata context. Our source code and data are available at https://github.com/K4TEL/geo-twitter.git.
Publishing Year
Date Published
2024-12-26
Journal Title
Journal of Spatial Information Science
Publisher
University of Maine
Acknowledgement
The authors acknowledge the Institute of Science and Technology (ISTA) for their material support and for granting access to the Twitter database archive, which was essential for the research.
Issue
29
Page
69-99
eISSN
IST-REx-ID
Cite this
Lutsai K, Lampert C. Predicting the geolocation of tweets using transformer models on customized data. Journal of Spatial Information Science. 2024;(29):69-99. doi:10.5311/JOSIS.2024.29.295
Lutsai, K., & Lampert, C. (2024). Predicting the geolocation of tweets using transformer models on customized data. Journal of Spatial Information Science. University of Maine. https://doi.org/10.5311/JOSIS.2024.29.295
Lutsai, Kateryna, and Christoph Lampert. “Predicting the Geolocation of Tweets Using Transformer Models on Customized Data.” Journal of Spatial Information Science. University of Maine, 2024. https://doi.org/10.5311/JOSIS.2024.29.295.
K. Lutsai and C. Lampert, “Predicting the geolocation of tweets using transformer models on customized data,” Journal of Spatial Information Science, no. 29. University of Maine, pp. 69–99, 2024.
Lutsai K, Lampert C. 2024. Predicting the geolocation of tweets using transformer models on customized data. Journal of Spatial Information Science. (29), 69–99.
Lutsai, Kateryna, and Christoph Lampert. “Predicting the Geolocation of Tweets Using Transformer Models on Customized Data.” Journal of Spatial Information Science, no. 29, University of Maine, 2024, pp. 69–99, doi:10.5311/JOSIS.2024.29.295.
All files available under the following license(s):
Creative Commons Attribution 3.0 Unported (CC BY 3.0):
Main File(s)
File Name
Access Level

Date Uploaded
2025-01-20
MD5 Checksum
b82413f00398ffb5168e8e747571a98d