Learning Joint Topic Representation for Detecting Drift in Social Media Text

被引:0
|
作者
Vijayarani, J. [1 ]
Geetha, T. V. [2 ]
机构
[1] Deemed Univ, Hindustan Inst Technol & Sci, Chennai, India
[2] SSN Coll Engn, Chennai, India
关键词
Topic drift; hashtag; geotag; Langevin dynamics; word embedding; topic2vec; MODEL;
D O I
10.1142/S0218488524500247
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Social media texts like tweets and blogs are collaboratively created by human interaction. Rapidly changing trends are leading to topic drift in the social media text. This drift is usually associated with words and hashtags. However, geotags play an essential part in determining topic distribution with location context. The rate of change in the distribution of words, hashtags and geotags cannot be considered uniform and must be handled accordingly. This paper builds a topic model that associates the topic with a mixture of distributions of words, hashtags and geotags. Stochastic gradient Langevin dynamic model with varying mini-batch sizes is used to capture the changes due to the asynchronous distribution of words and tags. Topic representations with co-occurrence and location contexts are specified as hashtag context vector and geotag context vector respectively. These two vectors are jointly learned to yield topical word embedding vectors over time conditioned on hashtags and geotags that can predict location-based topical variations effectively. When evaluated with Chennai and UK geolocated Twitter data, the proposed joint topical word embedding model enhanced by the social tags context, outperforms other methods.
引用
收藏
页码:955 / 983
页数:29
相关论文
共 50 条
  • [1] Latent Topic Text Representation Learning on Statistical Manifolds
    Jiang, Bingbing
    Li, Zhengyu
    Chen, Huanhuan
    Cohn, Anthony G.
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2018, 29 (11) : 5643 - 5654
  • [2] Topic Modeling over Text Streams from Social Media
    Smatana, Miroslav
    Paralic, Jan
    Butka, Peter
    TEXT, SPEECH, AND DIALOGUE, 2016, 9924 : 163 - 172
  • [3] Topic text detection by clustering algorithm for social network media
    Sha S.
    International Journal of Networking and Virtual Organisations, 2024, 30 (03) : 246 - 256
  • [4] A Dataset for Detecting Humor in Telugu Social Media Text
    Bellamkonda, Sriphani Vardhan
    Lohakare, Maithili
    Patel, Shaswat P.
    PROCEEDINGS OF THE SECOND WORKSHOP ON SPEECH AND LANGUAGE TECHNOLOGIES FOR DRAVIDIAN LANGUAGES (DRAVIDIANLANGTECH 2022), 2022, : 9 - 14
  • [5] Detecting clickbaits automatically for analyzing social media text
    Lakshmidheeraj
    Kumaran, U.
    Devi, T.
    Test Engineering and Management, 2019, 81 (11-12): : 5449 - 5453
  • [6] Study on text representation method based on deep learning and topic information
    Jiang, Zilong
    Gao, Shu
    Chen, Liangchen
    COMPUTING, 2020, 102 (03) : 623 - 642
  • [7] Study on text representation method based on deep learning and topic information
    Zilong Jiang
    Shu Gao
    Liangchen Chen
    Computing, 2020, 102 : 623 - 642
  • [8] Joint Emotion-Topic Modeling for Social Affective Text Mining
    Bao, Shenghua
    Xu, Shengliang
    Zhang, Li
    Yan, Rong
    Su, Zhong
    Han, Dingyi
    Yu, Yong
    2009 9TH IEEE INTERNATIONAL CONFERENCE ON DATA MINING, 2009, : 699 - +
  • [9] A joint-training topic model for social media texts
    Qin, Simeng
    Zhang, Mingli
    Hu, Haiju
    Li, Gang
    HUMANITIES & SOCIAL SCIENCES COMMUNICATIONS, 2025, 12 (01):
  • [10] TOPIC STRUCTURE REPRESENTATION AND TEXT RECALL
    LORCH, RF
    LORCH, EP
    JOURNAL OF EDUCATIONAL PSYCHOLOGY, 1985, 77 (02) : 137 - 148