Geolocation Detection Approaches for User Discussion Analysis in Twitter

被引:2
|
作者
Blekanov, Ivan [1 ]
Maksimov, Alexey [1 ]
Nepiyushchikh, Dmitry [1 ]
Bodrunova, Svetlana S. [1 ]
机构
[1] St Petersburg State Univ, St Petersburg 199034, Russia
基金
俄罗斯科学基金会;
关键词
Social network analysis; Geolocation detection; Twitter users discussion; Open street map service; Name entity recognition model; User graph analysis;
D O I
10.1007/978-3-031-22131-6_2
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this research, the authors consider methods for identifying geodata of users of social networks within user discussions. The knowledge of user geolocation data makes it possible to analyze the spread of discussion among users of different countries. Authors do not try to determine the exact geolocation, but rather the country where the users are located. The problem of getting country-level user location data lies in the fact that a high percentage of users do not state their location correctly, either mentioning it in humorous ways or even not stating it at all. There are various methods of obtaining data about the location of users. Among them, there are text-based methods, methods based on the analysis of the context, and methods based on the topology of the user graph. In this paper, we make a special emphasis on a method that allows to reveal geodata of users who specified their geodata incorrectly or did not specify it at all. In order to test our method, we use Twitter datasets. We propose several approaches to resolve the issues stated above. The paper highlights three approaches: the naive approach, the naive approach using natural language processing (NLP), and the graph approach, which is glossary-based and determines the number of outgoing connections. We have introduced twomeasures in order to evaluate the proposed approaches. Recall-GEO and Precision-GEO that are described throughout the paper. The accuracy of UserGraph method is finally evaluated using the metrics above.
引用
收藏
页码:16 / 29
页数:14
相关论文
共 50 条
  • [21] GEO-SEQ2SEQ: Twitter User Geolocation on Noisy Data through Sequence to Sequence Learning
    Zhang, Jingyu
    DeLucia, Alexandra
    Zhang, Chenyu
    Dredze, Mark
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL 2023, 2023, : 4778 - 4794
  • [22] User Mood Tracking for Opinion Analysis on Twitter
    Castellucci, Giuseppe
    Croce, Danilo
    De Cao, Diego
    Basili, Roberto
    AI*IA 2016: ADVANCES IN ARTIFICIAL INTELLIGENCE, 2016, 10037 : 76 - 88
  • [24] Automation on Twitter: Measuring the Effectiveness of Approaches to Bot Detection
    Beatson, Oliver
    Gibson, Rachel
    Cunill, Marta Cantijoch
    Elliot, Mark
    SOCIAL SCIENCE COMPUTER REVIEW, 2023, 41 (01) : 181 - 200
  • [25] Where in the World Are You? Geolocation and Language Identification in Twitter
    Graham, Mark
    Hale, Scott A.
    Gaffney, Devin
    PROFESSIONAL GEOGRAPHER, 2014, 66 (04): : 568 - 578
  • [26] Semantics-enabled User Interest Detection from Twitter
    Zarrinkalam, Fattane
    Fani, Hossein
    Bagheri, Ebrahim
    Kahani, Mohsen
    Du, Weichang
    2015 IEEE/WIC/ACM INTERNATIONAL CONFERENCE ON WEB INTELLIGENCE AND INTELLIGENT AGENT TECHNOLOGY (WI-IAT), VOL 1, 2015, : 469 - 476
  • [27] Detection of threatening user accounts on Twitter social media database
    Kumari, Asha
    Balkishan
    INTERNATIONAL JOURNAL OF INTELLIGENT ENGINEERING INFORMATICS, 2019, 7 (05) : 457 - 489
  • [28] Influential User Detection on Twitter: Analyzing Effect of Focus Rate
    Alp, Zeynep Zengin
    Oguducu, Sule Gunduz
    PROCEEDINGS OF THE 2016 IEEE/ACM INTERNATIONAL CONFERENCE ON ADVANCES IN SOCIAL NETWORKS ANALYSIS AND MINING ASONAM 2016, 2016, : 1321 - 1328
  • [29] A software architecture for Twitter collection, search and geolocation services
    Oussalah, M.
    Bhat, F.
    Challis, K.
    Schnier, T.
    KNOWLEDGE-BASED SYSTEMS, 2013, 37 : 105 - 120
  • [30] An overview of microblog user geolocation methods
    Luo, Xiangyang
    Qiao, Yaqiong
    Li, Chenliang
    Ma, Jiangtao
    Liu, Yimin
    INFORMATION PROCESSING & MANAGEMENT, 2020, 57 (06)