ACM Transactions on the Web

Twitter User Geolocation Based on Location Feature Enhancement

Meng Zhang 1
Xiangyang Luo 1
Ning-Bo Huang 1
Yimin Liu 1
Shaoyong Du 1, 2
1
 
Henan Key Laboratory of Cyberspace Situation Awareness, Zhengzhou, China
2
 
State Key Laboratory of Information Security, Beijing, China
Publication typeJournal Article
Publication date2025-01-13
scimago Q2
wos Q2
SJR0.787
CiteScore4.9
Impact factor2.6
ISSN15591131, 1559114X
Abstract

User location discovery from social media is crucial for location-based services like emergency awareness and event monitoring. Existing approaches generally integrate user-generated text features and social relationships, but insufficiently explore location-specific features and geographically proximate relationships, leading to suboptimal accuracy. In this paper, we propose a Twitter user geolocation method based on location feature enhancement, to better capture the location characteristics in users’ tweets and social relationships. Specifically, a user tweet representation algorithm based on location feature separation (TwLS) is designed. By leveraging words’ location-aware weight matrix and pre-trained embeddings, TwLS calculates a tweet representation for each user in every location, explicitly indicating the relevance between users and various locations. Additionally, we develop the local celebrity discovery method (LocCel) to construct social networks by identifying and preserving geographically concentrated high-degree nodes while filtering noise. Thereby LocCel enhances local relationships and strengthens location-proximate connections within the user social network. Experiments on two real-world datasets show that our method outperforms seven baselines, improving user geolocation accuracy by 3.1% ∼ 8.1% and 1.8% ∼ 8.8%, while reducing median error by 22.2% ∼ 52.8% and 19.4% ∼ 50.7%, respectively.

Are you a researcher?

Create a profile to get free access to personal recommendations for colleagues and new articles.
Share
Cite this
GOST | RIS | BibTex
Found error?