Sparse Topical Coding with Sparse Groups

被引:2
|
作者
Peng, Min [1 ]
Xie, Qianqian [1 ]
Huang, Jiajia [1 ]
Zhu, Jiahui [1 ]
Ouyang, Shuang [1 ]
Huang, Jimin [1 ]
Tian, Gang [1 ]
机构
[1] Wuhan Univ, Sch Comp, Wuhan, Peoples R China
来源
WEB-AGE INFORMATION MANAGEMENT, PT I | 2016年 / 9658卷
关键词
Document representation; Topic model; Sparse coding; Sparse group lasso; REGRESSION; SELECTION;
D O I
10.1007/978-3-319-39937-9_32
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Learning a latent semantic representing from a large number of short text corpora makes a profound practical significance in research and engineering. However, it is difficult to use standard topic models in microblogging environments since microblogs have short length, large amount, snarled noise and irregular modality characters, which prevent topic models from using full information of microblogs. In this paper, we propose a novel non-probabilistic topic model called sparse topical coding with sparse groups (STCSG), which is capable of discovering sparse latent semantic representations of large short text corpora. STCSG relaxes the normalization constraint of the inferred representations with sparse group lasso, a sparsity-inducing regularizer, which is convenient to directly control the sparsity of document, topic and word codes. Furthermore, the relaxed non-probabilistic STCSG can be effectively learned with alternating direction method of multipliers (ADMM). Our experimental results on Twitter dataset demonstrate that STCSG performs well in finding meaningful latent representations of short documents. Therefore, it can substantially improve the accuracy and efficiency of document classification.
引用
收藏
页码:415 / 426
页数:12
相关论文
共 50 条
  • [1] Neural Sparse Topical Coding
    Peng, Min
    Xie, Qianqian
    Zhang, Yanchun
    Wang, Hua
    Zhang, Xiuzheng
    Huang, Jimin
    Tian, Gang
    PROCEEDINGS OF THE 56TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL), VOL 1, 2018, : 2332 - 2340
  • [2] Bayesian Sparse Topical Coding
    Peng, Min
    Xie, Qianqian
    Wang, Hua
    Zhang, Yanchun
    Tian, Gang
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2019, 31 (06) : 1080 - 1093
  • [3] Block Bayesian Sparse Topical Coding
    Peng, Min
    Shi, Hongliang
    Xie, Qianqian
    Zhang, Yihan
    Wang, Hua
    Li, Zhaoyunfei
    Yong, Jianming
    PROCEEDINGS OF THE 2018 IEEE 22ND INTERNATIONAL CONFERENCE ON COMPUTER SUPPORTED COOPERATIVE WORK IN DESIGN ((CSCWD)), 2018, : 271 - 276
  • [4] Clustering Improvement via Integrating with Sparse Topical Coding
    Ahmadi, Parvin
    Kaviani, Razie
    Gholampour, Iman
    Tabandeh, Mahmoud
    2015 23RD IRANIAN CONFERENCE ON ELECTRICAL ENGINEERING (ICEE), 2015, : 466 - 471
  • [5] Dynamic scene understanding by improved sparse topical coding
    Fu, Wei
    Wang, Jinqiao
    Lu, Hanqing
    Ma, Songde
    PATTERN RECOGNITION, 2013, 46 (07) : 1841 - 1850
  • [6] Sparse coding in sparse winner networks
    Starzyk, Janusz A.
    Liu, Yinyin
    Vogel, David
    ADVANCES IN NEURAL NETWORKS - ISNN 2007, PT 2, PROCEEDINGS, 2007, 4492 : 534 - +
  • [7] Sparse Multi-Modal Topical Coding for Image Annotation
    Song, Lingyun
    Luo, Minnan
    Liu, Jun
    Zhang, Lingling
    Qian, Buyue
    Li, Max Haifei
    Zheng, Qinghua
    NEUROCOMPUTING, 2016, 214 : 162 - 174
  • [8] Sparse Relational Topical Coding on multi-modal data
    Song, Lingyun
    Liu, Jun
    Luo, Minnan
    Qian, Buyue
    Yang, Kuan
    PATTERN RECOGNITION, 2017, 72 : 368 - 380
  • [9] Discovery of the Topical Object in Commercial Video: A Sparse Coding Method
    Liu, Yunhui
    Liu, Huaping
    Sun, Fuchun
    PATTERN RECOGNITION (CCPR 2014), PT II, 2014, 484 : 245 - 254
  • [10] Laplacian Sparse Coding, Hypergraph Laplacian Sparse Coding, and Applications
    Gao, Shenghua
    Tsang, Ivor Wai-Hung
    Chia, Liang-Tien
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2013, 35 (01) : 92 - 104