Domain-based Latent Personal Analysis and its use for impersonation detection in social media

被引:0
|
作者
Osnat Mokryn
Hagit Ben-Shoshan
机构
[1] University of Haifa,Information Systems
[2] University of Haifa,Management
关键词
Latent Personal Analysis (LPA); Zipf; Authorship attribution; Impersonation; Sockpuppets; Front-users;
D O I
暂无
中图分类号
学科分类号
摘要
Zipf’s law defines an inverse proportion between a word’s ranking in a given corpus and its frequency in it, roughly dividing the vocabulary into frequent words and infrequent ones. Here, we stipulate that within a domain an author’s signature can be derived from, in loose terms, the author’s missing popular words and frequently used infrequent words. We devise a method, termed Latent Personal Analysis (LPA), for finding domain-based attributes for entities in a domain: their distance from the domain and their signature, which determines how they most differ from a domain. We identify the most suitable distance metric for the method among several and construct the distances and personal signatures for authors, the domain’s entities. The signature consists of both over-used terms (compared to the average) and missing popular terms. We validate the correctness and power of the signatures in identifying users and set existence conditions. We test LPA in several domains, both textual and non-textual. We then demonstrate the use of the method in explainable authorship attribution: we define algorithms that utilize LPA  to identify two types of impersonation in social media: (1) authors with sockpuppets (multiple) accounts and (2) front-users accounts, operated by several authors. We validate the algorithms and employ them over a large-scale dataset obtained from a social media site with over 4000 users. We corroborate these results using temporal rate analysis. LPA  can further be used to devise personal attributes in a wide range of scientific domains in which the constituents have a long-tail distribution of elements.
引用
收藏
页码:785 / 828
页数:43
相关论文
共 50 条
  • [21] Use of Social Media for the Detection and Analysis of Infectious Diseases in China
    Ye, Xinyue
    Li, Shengwen
    Yang, Xining
    Qin, Chenglin
    ISPRS INTERNATIONAL JOURNAL OF GEO-INFORMATION, 2016, 5 (09)
  • [22] Remote Attestation with Domain-Based Integrity Model and Policy Analysis
    Xu, Wenjuan
    Zhang, Xinwen
    Hu, Hongxin
    Ahn, Gail-Joon
    Seifert, Jean-Pierre
    IEEE TRANSACTIONS ON DEPENDABLE AND SECURE COMPUTING, 2012, 9 (03) : 429 - 442
  • [23] Local manifold learning and its link to domain-based physics knowledge
    Zdybal, Kamila
    D'Alessio, Giuseppe
    Attili, Antonio
    Coussement, Axel
    Sutherland, James C.
    Parente, Alessandro
    APPLICATIONS IN ENERGY AND COMBUSTION SCIENCE, 2023, 14
  • [24] Automated Seeded Latent Dirichlet Allocation for Social Media Based Event Detection and Mapping
    Ferner, Cornelia
    Havas, Clemens
    Birnbacher, Elisabeth
    Wegenkittl, Stefan
    Resch, Bernd
    INFORMATION, 2020, 11 (08)
  • [25] The course of problematic social media use in young adolescents: A latent class growth analysis
    Boer, Maartje
    Stevens, Gonneke W. J. M.
    Finkenauer, Catrin
    Van den Eijnden, Regina J. J. M.
    CHILD DEVELOPMENT, 2022, 93 (02) : E168 - E187
  • [26] A latent class analysis of adolescents' technology and interactive social media use: Associations with academics and substance use
    Tang, Sandra
    Patrick, Megan E.
    HUMAN BEHAVIOR AND EMERGING TECHNOLOGIES, 2020, 2 (01) : 50 - 60
  • [27] Paradoxical implications of personal social media use for work
    van Zoonen, Ward
    Rice, Ronald E.
    NEW TECHNOLOGY WORK AND EMPLOYMENT, 2017, 32 (03) : 228 - 246
  • [28] Pharmacy students' personal and professional use of social media
    Jeminiwa, Ruth
    Shamsuddin, Fatana
    Clauson, Kevin A.
    Cain, Jeff
    Fox, Brent I.
    CURRENTS IN PHARMACY TEACHING AND LEARNING, 2021, 13 (06) : 599 - 607
  • [29] Personal Credit Profiling via Latent User Behavior Dimensions on Social Media
    Guo, Guangming
    Zhu, Feida
    Chen, Enhong
    Wu, Le
    Liu, Qi
    Liu, Yingling
    Qiu, Minghui
    ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PAKDD 2016, PT II, 2016, 9652 : 130 - 142
  • [30] A Preliminary Approach to Domain-based Evaluation of Users' Trustworthiness in Online Social Networks
    Abu Salih, Bilal
    Wongthongtham, Pornpit
    Beheshti, Seyed-Mehdi-Reza
    Zhu, Dengya
    2015 IEEE INTERNATIONAL CONGRESS ON BIG DATA - BIGDATA CONGRESS 2015, 2015, : 460 - 466