Learning Joint Topic Representation for Detecting Drift in Social Media Text

被引:0
|
作者
Vijayarani, J. [1 ]
Geetha, T. V. [2 ]
机构
[1] Deemed Univ, Hindustan Inst Technol & Sci, Chennai, India
[2] SSN Coll Engn, Chennai, India
关键词
Topic drift; hashtag; geotag; Langevin dynamics; word embedding; topic2vec; MODEL;
D O I
10.1142/S0218488524500247
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Social media texts like tweets and blogs are collaboratively created by human interaction. Rapidly changing trends are leading to topic drift in the social media text. This drift is usually associated with words and hashtags. However, geotags play an essential part in determining topic distribution with location context. The rate of change in the distribution of words, hashtags and geotags cannot be considered uniform and must be handled accordingly. This paper builds a topic model that associates the topic with a mixture of distributions of words, hashtags and geotags. Stochastic gradient Langevin dynamic model with varying mini-batch sizes is used to capture the changes due to the asynchronous distribution of words and tags. Topic representations with co-occurrence and location contexts are specified as hashtag context vector and geotag context vector respectively. These two vectors are jointly learned to yield topical word embedding vectors over time conditioned on hashtags and geotags that can predict location-based topical variations effectively. When evaluated with Chennai and UK geolocated Twitter data, the proposed joint topical word embedding model enhanced by the social tags context, outperforms other methods.
引用
收藏
页码:955 / 983
页数:29
相关论文
共 50 条
  • [31] JKRL: Joint Knowledge Representation Learning of Text Description and Knowledge Graph
    Xu, Guoyan
    Zhang, Qirui
    Yu, Du
    Lu, Sijun
    Lu, Yuwei
    SYMMETRY-BASEL, 2023, 15 (05):
  • [32] Joint representation learning for text and 3D point cloud
    Huang, Rui
    Pan, Xuran
    Zheng, Henry
    Jiang, Haojun
    Xie, Zhifeng
    Wu, Cheng
    Song, Shiji
    Huang, Gao
    PATTERN RECOGNITION, 2024, 147
  • [33] An empirical evaluation of text representation schemes to filter the social media stream
    Modha, Sandip
    Majumder, Prasenjit
    Mandl, Thomas
    JOURNAL OF EXPERIMENTAL & THEORETICAL ARTIFICIAL INTELLIGENCE, 2022, 34 (03) : 499 - 525
  • [34] Hybrid Text Representation for Explainable Suicide Risk Identification on Social Media
    Naseem, Usman
    Khushi, Matloob
    Kim, Jinman
    Dunn, Adam G.
    IEEE TRANSACTIONS ON COMPUTATIONAL SOCIAL SYSTEMS, 2024, 11 (04) : 4663 - 4672
  • [35] Learning Discriminative Text Representation for Streaming Social Event Detection
    Tong, Chaodong
    Peng, Huailiang
    Bai, Xu
    Dai, Qiong
    Zhang, Ruitong
    Li, Yangyang
    Xu, Hanjie
    Gu, Xian-Ming
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2023, 35 (12) : 12295 - 12309
  • [36] JointGT: Graph-Text Joint Representation Learning for Text Generation from Knowledge Graphs
    Ke, Pei
    Ji, Haozhe
    Ran, Yu
    Cui, Xin
    Wang, Liwei
    Song, Linfeng
    Zhu, Xiaoyan
    Huang, Minlie
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL-IJCNLP 2021, 2021, : 2526 - 2538
  • [37] Joint Sentiment/Topic Extraction from Text
    Sowmiya, J. S.
    Chandrakala, S.
    2014 INTERNATIONAL CONFERENCE ON ADVANCED COMMUNICATION CONTROL AND COMPUTING TECHNOLOGIES (ICACCCT), 2014, : 611 - 615
  • [38] Topic-aware joint analysis of overlapping communities and roles in social media
    Costa, Gianni
    Ortale, Riccardo
    INTERNATIONAL JOURNAL OF DATA SCIENCE AND ANALYTICS, 2020, 9 (04) : 415 - 429
  • [39] Topic-aware joint analysis of overlapping communities and roles in social media
    Gianni Costa
    Riccardo Ortale
    International Journal of Data Science and Analytics, 2020, 9 : 415 - 429
  • [40] Measuring and Detecting Virality on Social Media: The Case of Twiter's Viral Tweets Topic
    Elmas, Tugrulcan
    Selim, Stephane
    Houssiaux, Celia
    COMPANION OF THE WORLD WIDE WEB CONFERENCE, WWW 2023, 2023, : 314 - 317