Domain-based Latent Personal Analysis and its use for impersonation detection in social media

被引:0
|
作者
Osnat Mokryn
Hagit Ben-Shoshan
机构
[1] University of Haifa,Information Systems
[2] University of Haifa,Management
关键词
Latent Personal Analysis (LPA); Zipf; Authorship attribution; Impersonation; Sockpuppets; Front-users;
D O I
暂无
中图分类号
学科分类号
摘要
Zipf’s law defines an inverse proportion between a word’s ranking in a given corpus and its frequency in it, roughly dividing the vocabulary into frequent words and infrequent ones. Here, we stipulate that within a domain an author’s signature can be derived from, in loose terms, the author’s missing popular words and frequently used infrequent words. We devise a method, termed Latent Personal Analysis (LPA), for finding domain-based attributes for entities in a domain: their distance from the domain and their signature, which determines how they most differ from a domain. We identify the most suitable distance metric for the method among several and construct the distances and personal signatures for authors, the domain’s entities. The signature consists of both over-used terms (compared to the average) and missing popular terms. We validate the correctness and power of the signatures in identifying users and set existence conditions. We test LPA in several domains, both textual and non-textual. We then demonstrate the use of the method in explainable authorship attribution: we define algorithms that utilize LPA  to identify two types of impersonation in social media: (1) authors with sockpuppets (multiple) accounts and (2) front-users accounts, operated by several authors. We validate the algorithms and employ them over a large-scale dataset obtained from a social media site with over 4000 users. We corroborate these results using temporal rate analysis. LPA  can further be used to devise personal attributes in a wide range of scientific domains in which the constituents have a long-tail distribution of elements.
引用
收藏
页码:785 / 828
页数:43
相关论文
共 50 条
  • [31] Ensemble-based domain adaptation on social media posts for irony detection
    Saroj, Anita
    Pal, Sukomal
    MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 83 (8) : 23249 - 23268
  • [32] Ensemble-based domain adaptation on social media posts for irony detection
    Anita Saroj
    Sukomal Pal
    Multimedia Tools and Applications, 2024, 83 : 23249 - 23268
  • [33] Novel DNA binding domain-based assays for detection of methylated and nonmethylated DNA
    Acevedo, Luis G.
    Sanz, Ana
    Jelinek, Mary Anne
    EPIGENOMICS, 2011, 3 (01) : 93 - 101
  • [34] SynFinder: A System for Domain-Based Detection of Synonyms Using WordNet and the Web of Data
    Lombardi, Matteo
    Marani, Alessandro
    ADVANCES IN ARTIFICIAL INTELLIGENCE AND SOFT COMPUTING, MICAI 2015, PT I, 2015, 9413 : 15 - 28
  • [35] On the use of domain-based material point methods for problems involving large distortion
    Wang, L.
    Coombs, W. M.
    Augarde, C. E.
    Cortis, M.
    Charlton, T. J.
    Brown, M. J.
    Knappett, J.
    Brennan, A.
    Davidson, C.
    Richards, D.
    Blake, A.
    COMPUTER METHODS IN APPLIED MECHANICS AND ENGINEERING, 2019, 355 : 1003 - 1025
  • [36] A transform domain-based anomaly detection approach to network-wide traffic
    Jiang, Dingde
    Xu, Zhengzheng
    Zhang, Peng
    Zhu, Ting
    JOURNAL OF NETWORK AND COMPUTER APPLICATIONS, 2014, 40 : 292 - 306
  • [37] Sustainable and lightweight domain-based intrusion detection system for in-vehicle network
    Kristianto, Edy
    Lin, Po -Ching
    Hwang, Ren-Hung
    SUSTAINABLE COMPUTING-INFORMATICS & SYSTEMS, 2024, 41
  • [38] Development of a new time domain-based algorithm for train detection and axle counting
    Allotta, B.
    D'Adamio, P.
    Meli, E.
    Pugi, L.
    VEHICLE SYSTEM DYNAMICS, 2015, 53 (12) : 1850 - 1875
  • [39] Developing Social Media Use Purposes Scale and Examining Based on Some Personal Variables
    Sisman Eren, Esra
    HACETTEPE UNIVERSITESI EGITIM FAKULTESI DERGISI-HACETTEPE UNIVERSITY JOURNAL OF EDUCATION, 2014, 29 (04): : 230 - 243
  • [40] Avoidance of Social Media Advertising: A Latent Profile Analysis
    Mattke, Jens
    Mueller, Lea
    Maier, Christian
    Graser, Heinrich
    SIGMIS-CPR'18: PROCEEDINGS OF THE 2018 ACM SIGMIS CONFERENCE ON COMPUTERS AND PEOPLE RESEARCH, 2018, : 50 - 57