Domain-based Latent Personal Analysis and its use for impersonation detection in social media

被引:0
|
作者
Osnat Mokryn
Hagit Ben-Shoshan
机构
[1] University of Haifa,Information Systems
[2] University of Haifa,Management
关键词
Latent Personal Analysis (LPA); Zipf; Authorship attribution; Impersonation; Sockpuppets; Front-users;
D O I
暂无
中图分类号
学科分类号
摘要
Zipf’s law defines an inverse proportion between a word’s ranking in a given corpus and its frequency in it, roughly dividing the vocabulary into frequent words and infrequent ones. Here, we stipulate that within a domain an author’s signature can be derived from, in loose terms, the author’s missing popular words and frequently used infrequent words. We devise a method, termed Latent Personal Analysis (LPA), for finding domain-based attributes for entities in a domain: their distance from the domain and their signature, which determines how they most differ from a domain. We identify the most suitable distance metric for the method among several and construct the distances and personal signatures for authors, the domain’s entities. The signature consists of both over-used terms (compared to the average) and missing popular terms. We validate the correctness and power of the signatures in identifying users and set existence conditions. We test LPA in several domains, both textual and non-textual. We then demonstrate the use of the method in explainable authorship attribution: we define algorithms that utilize LPA  to identify two types of impersonation in social media: (1) authors with sockpuppets (multiple) accounts and (2) front-users accounts, operated by several authors. We validate the algorithms and employ them over a large-scale dataset obtained from a social media site with over 4000 users. We corroborate these results using temporal rate analysis. LPA  can further be used to devise personal attributes in a wide range of scientific domains in which the constituents have a long-tail distribution of elements.
引用
收藏
页码:785 / 828
页数:43
相关论文
共 50 条
  • [1] Domain-based Latent Personal Analysis and its use for impersonation detection in social media
    Mokryn, Osnat
    Ben-Shoshan, Hagit
    USER MODELING AND USER-ADAPTED INTERACTION, 2021, 31 (04) : 785 - 828
  • [2] Spatial Reliability Assessment of Social Media Mining Techniques with Regard to Disaster Domain-Based Filtering
    Gulnerman, Ayse Giz
    Karaman, Himmet
    ISPRS INTERNATIONAL JOURNAL OF GEO-INFORMATION, 2020, 9 (04)
  • [3] Frequency domain-based latent diffusion model for underwater image enhancement
    Song, Jingyu
    Xu, Haiyong
    Jiang, Gangyi
    Yu, Mei
    Chen, Yeyao
    Luo, Ting
    Song, Yang
    PATTERN RECOGNITION, 2025, 160
  • [4] Acquisition of the algorithms of social life: A domain-based approach
    Bugental, DB
    PSYCHOLOGICAL BULLETIN, 2000, 126 (02) : 187 - 219
  • [5] DOMAIN-Based Intelligent Network Intrusion Detection System
    Jose, Nithil
    Govindarajan, J.
    INVENTIVE COMPUTATION AND INFORMATION TECHNOLOGIES, ICICIT 2021, 2022, 336 : 449 - 462
  • [6] A PDZ domain-based detection system for enzymatic assays
    Ferrer, M
    Hamilton, AC
    Inglese, J
    ANALYTICAL BIOCHEMISTRY, 2002, 301 (02) : 207 - 216
  • [7] Periphrases and Negative Inductors: A Domain-Based Analysis
    Fabregas, Antonio
    Gonzalez Rodriguez, Raquel
    ONOMAZEIN, 2019, (43): : 95 - 113
  • [8] Time-aware domain-based social influence prediction
    Bilal Abu-Salih
    Kit Yan Chan
    Omar Al-Kadi
    Marwan Al-Tawil
    Pornpit Wongthongtham
    Tomayess Issa
    Heba Saadeh
    Malak Al-Hassan
    Bushra Bremie
    Abdulaziz Albahlal
    Journal of Big Data, 7
  • [9] Time-aware domain-based social influence prediction
    Abu-Salih, Bilal
    Chan, Kit Yan
    Al-Kadi, Omar
    Al-Tawil, Marwan
    Wongthongtham, Pornpit
    Issa, Tomayess
    Saadeh, Heba
    Al-Hassan, Malak
    Bremie, Bushra
    Albahlal, Abdulaziz
    JOURNAL OF BIG DATA, 2020, 7 (01)
  • [10] Social media use in the context of the Personal Social Media Ecosystem Framework
    Carter, Michael C.
    Cingel, Drew P.
    Ruiz, Jeanette B.
    Wartella, Ellen
    JOURNAL OF COMMUNICATION, 2023, 73 (01) : 25 - 37