Exploiting Unlabeled Data for Question Classification

被引:0
|
作者
Tomas, David [1 ]
Giuliano, Claudio [2 ]
机构
[1] Univ Alicante, Dept Software & Comp Syst, Alicante, Spain
[2] FBK Irst, Human Language Technol Grp, Trento, Italy
关键词
question classification; semi-supervised learning; kernel methods;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, we introduce a kernel-based approach to question classification. We employed a kernel function based on latent semantic information acquired from Wikipedia. This kernel allows including external semantic knowledge into the supervised learning process. We obtained a highly effective question classifier combining this knowledge with a bag-of-words approach by means of composite kernels. As the semantic information is acquired from unlabeled text, our system can be easily adapted to different languages and domains. We tested it on a parallel corpus of English and Spanish questions.
引用
收藏
页码:137 / 144
页数:8
相关论文
共 50 条
  • [31] Boosting Semi-Supervised Learning by Exploiting All Unlabeled Data
    Chen, Yuhao
    Tan, Xin
    Zhao, Borui
    Chen, Zhaowei
    Song, Renjie
    Liang, Jiajun
    Lu, Xuequan
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR, 2023, : 7548 - 7557
  • [32] Tri-training: Exploiting unlabeled data using three classifiers
    Zhou, ZH
    Li, M
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2005, 17 (11) : 1529 - 1541
  • [33] Active learning for deep object detection by fully exploiting unlabeled data
    Tan, Feixiang
    Zheng, Guansheng
    CONNECTION SCIENCE, 2023, 35 (01)
  • [34] Exploiting Unlabeled Data in CNNs by Self-Supervised Learning to Rank
    Liu, Xialei
    van de Weijer, Joost
    Bagdanov, Andrew D.
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2019, 41 (08) : 1862 - 1878
  • [35] Biological sequence classification utilizing positive and unlabeled data
    Xiao, Yuanyuan
    Segal, Mark R.
    BIOINFORMATICS, 2008, 24 (09) : 1198 - 1205
  • [36] Incorporating large unlabeled data to enhance EM classification
    Wu, Xintao
    JOURNAL OF INTELLIGENT INFORMATION SYSTEMS, 2006, 26 (03) : 211 - 226
  • [37] Unlabeled data selection for active learning in image classification
    Li, Xiongquan
    Wang, Xukang
    Chen, Xuhesheng
    Lu, Yao
    Fu, Hongpeng
    Wu, Ying Cheng
    SCIENTIFIC REPORTS, 2024, 14 (01)
  • [38] CLASSIFICATION FROM ONLY POSITIVE AND UNLABELED FUNCTIONAL DATA
    Terada, Yoshikazu
    Ogasawara, Issei
    Nakata, Ken
    ANNALS OF APPLIED STATISTICS, 2020, 14 (04): : 1724 - 1742
  • [39] Take full advantage of unlabeled data for sentiment classification
    La, Lei
    Cao, Shuyan
    Qin, Liangjuan
    KYBERNETES, 2018, 47 (03) : 474 - 486
  • [40] An Instance Level Analysis of Classification Difficulty for Unlabeled Data
    Ueda, Patricia S. M.
    Rivolli, Adriano
    Lorena, Ana Carolina
    INTELLIGENT SYSTEMS, BRACIS 2024, PT I, 2025, 15412 : 141 - 155