Geolocation Detection Approaches for User Discussion Analysis in Twitter

被引:2
|
作者
Blekanov, Ivan [1 ]
Maksimov, Alexey [1 ]
Nepiyushchikh, Dmitry [1 ]
Bodrunova, Svetlana S. [1 ]
机构
[1] St Petersburg State Univ, St Petersburg 199034, Russia
来源
HCI INTERNATIONAL 2022 - LATE BREAKING PAPERS: INTERACTION IN NEW MEDIA, LEARNING AND GAMES | 2022年 / 13517卷
基金
俄罗斯科学基金会;
关键词
Social network analysis; Geolocation detection; Twitter users discussion; Open street map service; Name entity recognition model; User graph analysis;
D O I
10.1007/978-3-031-22131-6_2
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this research, the authors consider methods for identifying geodata of users of social networks within user discussions. The knowledge of user geolocation data makes it possible to analyze the spread of discussion among users of different countries. Authors do not try to determine the exact geolocation, but rather the country where the users are located. The problem of getting country-level user location data lies in the fact that a high percentage of users do not state their location correctly, either mentioning it in humorous ways or even not stating it at all. There are various methods of obtaining data about the location of users. Among them, there are text-based methods, methods based on the analysis of the context, and methods based on the topology of the user graph. In this paper, we make a special emphasis on a method that allows to reveal geodata of users who specified their geodata incorrectly or did not specify it at all. In order to test our method, we use Twitter datasets. We propose several approaches to resolve the issues stated above. The paper highlights three approaches: the naive approach, the naive approach using natural language processing (NLP), and the graph approach, which is glossary-based and determines the number of outgoing connections. We have introduced twomeasures in order to evaluate the proposed approaches. Recall-GEO and Precision-GEO that are described throughout the paper. The accuracy of UserGraph method is finally evaluated using the metrics above.
引用
收藏
页码:16 / 29
页数:14
相关论文
共 50 条
  • [31] CORRELATION ANALYSIS OF USER INFLUENCE AND SENTIMENT ON TWITTER DATA
    Hanif, Fadhli Mubarak bin Naina
    Saptawati, G. A. Putri
    2014 INTERNATIONAL CONFERENCE ON DATA AND SOFTWARE ENGINEERING (ICODSE), 2014,
  • [32] An analysis of the user occupational class through Twitter content
    Preotiuc-Pietro, Daniel
    Lampos, Vasileios
    Aletras, Nikolaos
    PROCEEDINGS OF THE 53RD ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS AND THE 7TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING, VOL 1, 2015, : 1754 - 1764
  • [33] TURank: Twitter User Ranking Based on User-Tweet Graph Analysis
    Yamaguchi, Yuto
    Takahashi, Tsubasa
    Amagasa, Toshiyuki
    Kitagawa, Hiroyuki
    WEB INFORMATION SYSTEM ENGINEERING-WISE 2010, 2010, 6488 : 240 - 253
  • [34] User Anonymity on Twitter
    Peddinti, Sai Teja
    Ross, Keith W.
    Cappos, Justin
    IEEE SECURITY & PRIVACY, 2017, 15 (03) : 84 - 87
  • [35] A quantitative analysis of Twitter ("X") trends in the discussion of rhinoplasty
    Mandava, Shreya
    Oyer, Samuel L.
    Park, Stephen S.
    LARYNGOSCOPE INVESTIGATIVE OTOLARYNGOLOGY, 2024, 9 (01):
  • [36] Tools and approaches for topic detection from Twitter streams: survey
    Ibrahim, Rania
    Elbagoury, Ahmed
    Kamel, Mohamed S.
    Karray, Fakhri
    KNOWLEDGE AND INFORMATION SYSTEMS, 2018, 54 (03) : 511 - 539
  • [37] Tools and approaches for topic detection from Twitter streams: survey
    Rania Ibrahim
    Ahmed Elbagoury
    Mohamed S. Kamel
    Fakhri Karray
    Knowledge and Information Systems, 2018, 54 : 511 - 539
  • [38] Twitter spam detection: Survey of new approaches and comparative study
    Wu, Tingmin
    Wen, Sheng
    Xiang, Yang
    Zhou, Wanlei
    COMPUTERS & SECURITY, 2018, 76 : 265 - 284
  • [39] Where Are WeChat Users: A Geolocation Method Based on User Missequence State Analysis
    Shi, Wenqi
    Luo, Xiangyang
    Guo, Jiadong
    Liu, Chong
    Liu, Fenlin
    IEEE TRANSACTIONS ON COMPUTATIONAL SOCIAL SYSTEMS, 2021, 8 (02): : 319 - 331
  • [40] UbCadet: detection of compromised accounts in twitter based on user behavioural profiling
    Savyan PV
    S. Mary Saira Bhanu
    Multimedia Tools and Applications, 2020, 79 : 19349 - 19385