Learning Joint Topic Representation for Detecting Drift in Social Media Text

被引:0
|
作者
Vijayarani, J. [1 ]
Geetha, T. V. [2 ]
机构
[1] Deemed Univ, Hindustan Inst Technol & Sci, Chennai, India
[2] SSN Coll Engn, Chennai, India
关键词
Topic drift; hashtag; geotag; Langevin dynamics; word embedding; topic2vec; MODEL;
D O I
10.1142/S0218488524500247
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Social media texts like tweets and blogs are collaboratively created by human interaction. Rapidly changing trends are leading to topic drift in the social media text. This drift is usually associated with words and hashtags. However, geotags play an essential part in determining topic distribution with location context. The rate of change in the distribution of words, hashtags and geotags cannot be considered uniform and must be handled accordingly. This paper builds a topic model that associates the topic with a mixture of distributions of words, hashtags and geotags. Stochastic gradient Langevin dynamic model with varying mini-batch sizes is used to capture the changes due to the asynchronous distribution of words and tags. Topic representations with co-occurrence and location contexts are specified as hashtag context vector and geotag context vector respectively. These two vectors are jointly learned to yield topical word embedding vectors over time conditioned on hashtags and geotags that can predict location-based topical variations effectively. When evaluated with Chennai and UK geolocated Twitter data, the proposed joint topical word embedding model enhanced by the social tags context, outperforms other methods.
引用
收藏
页码:955 / 983
页数:29
相关论文
共 50 条
  • [21] A systematic review of the use of topic models for short text social media analysis
    Caitlin Doogan Poet Laureate
    Wray Buntine
    Henry Linger
    Artificial Intelligence Review, 2023, 56 : 14223 - 14255
  • [22] A systematic review of the use of topic models for short text social media analysis
    Laureate, Caitlin Doogan Poet
    Buntine, Wray
    Linger, Henry
    ARTIFICIAL INTELLIGENCE REVIEW, 2023, 56 (12) : 14223 - 14255
  • [23] Autoencoder as Assistant Supervisor: Improving Text Representation for Chinese Social Media Text Summarization
    Ma, Shuming
    Sun, Xu
    Lin, Junyang
    Wang, Houfeng
    PROCEEDINGS OF THE 56TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, VOL 2, 2018, : 725 - 731
  • [24] Discriminative Topic Sparse Representation for Text Categorization
    Zheng, Wenbin
    Liu, Yanqiu
    Lu, Huijuan
    Tang, Hong
    2017 10TH INTERNATIONAL SYMPOSIUM ON COMPUTATIONAL INTELLIGENCE AND DESIGN (ISCID), VOL. 1, 2017, : 454 - 457
  • [25] Dataless Text Classification with Pseudo Topic Representation
    Yan, Rong
    Chen, Qi
    Gao, Guanglai
    2020 IEEE 32ND INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI), 2020, : 1255 - 1259
  • [26] Topic interest, text representation, and quality of experience
    Schiefele, U
    CONTEMPORARY EDUCATIONAL PSYCHOLOGY, 1996, 21 (01) : 3 - 18
  • [27] Detecting Propaganda Techniques in Code-Switched Social Media Text
    Salman, Muhammad Umar
    Hanif, Asif
    Shehata, Shady
    Nakov, Preslav
    2023 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2023), 2023, : 16794 - 16812
  • [28] Detecting suicidality on social media: Machine learning at rescue
    Rabani, Syed Tanzeel
    Khanday, Akib Mohi Ud Din
    Khan, Qamar Rayees
    Hajam, Umar Ayoub
    Imran, Ali Shariq
    Kastrati, Zenun
    EGYPTIAN INFORMATICS JOURNAL, 2023, 24 (02) : 291 - 302
  • [29] Leveraging transfer learning for detecting misinformation on social media
    Reshi J.A.
    Ali R.
    International Journal of Information Technology, 2024, 16 (2) : 949 - 955
  • [30] Short-text learning in social media: a review
    Tommasel, Antonela
    Godoy, Daniela
    KNOWLEDGE ENGINEERING REVIEW, 2019, 34 : 1 - 38