A comparative study on supervised and unsupervised learning approaches for multilingual text categorization

被引:0
|
作者
Lee, Chung-Hong [1 ]
Yang, Hsin-Chang [2 ]
Chen, Ting-Chung [1 ]
Ma, Sheng-Min [1 ]
机构
[1] Natl Kaohsiung Univ Appl Sci, Dept Elect Engn, Kaohsiung, Taiwan
[2] Chang Jung Univ, Dept Informat Management, Tainan, Taiwan
关键词
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Recently users of internationally distributed information networks need tools and methods that will enable them to discover, retrieve and categorize relevant information, in whatever language and form it may have been stored. This drives a convergence of numerous interests from diverse research communities focusing on the issues related to multilingual text categorization. In this work we compare and evaluate the performance of the leading supervised and unsupervised approaches for multilingual text categorization by using various performance measures and standard document corpora. For simplicity, we selected Support Vector Machines (SVM) and Latent Semantic Indexing (LSI) techniques as representatives of supervised and unsupervised methods for multilingual text categorization, respectively. The preliminary results show that our platform models including both supervised and unsupervised learning methods have the potentials for multilingual text categorization.
引用
收藏
页码:511 / +
页数:2
相关论文
共 50 条
  • [41] Machine learning based approaches to text categorization with resampling methods
    Jo, T
    8TH WORLD MULTI-CONFERENCE ON SYSTEMICS, CYBERNETICS AND INFORMATICS, VOL II, PROCEEDINGS: COMPUTING TECHNIQUES, 2004, : 93 - 98
  • [42] Supervised and Unsupervised Deep Learning Approaches for EEG Seizure Prediction
    Georgis-Yap, Zakary
    Popovic, Milos R.
    Khan, Shehroz S.
    JOURNAL OF HEALTHCARE INFORMATICS RESEARCH, 2024, 8 (02) : 286 - 312
  • [43] Supervised and Unsupervised Deep Learning Approaches for EEG Seizure Prediction
    Zakary Georgis-Yap
    Milos R. Popovic
    Shehroz S. Khan
    Journal of Healthcare Informatics Research, 2024, 8 : 286 - 312
  • [44] An application of supervised and unsupervised learning approaches to telecommunications fraud detection
    Hilas, Constantinos S.
    Mastorocostas, Paris As.
    KNOWLEDGE-BASED SYSTEMS, 2008, 21 (07) : 721 - 726
  • [45] Supervised and Unsupervised Machine Learning Approaches for Bridge Damage Prediction
    Tamura, S.
    Zhang, B.
    Wang, Y.
    Chen, F.
    Nguyen, K.
    STRUCTURAL HEALTH MONITORING 2013, VOLS 1 AND 2, 2013, : 182 - 189
  • [46] Learning to Speak from Text: Zero-Shot Multilingual Text-to-Speech with Unsupervised Text Pretraining
    Saeki, Takaaki
    Maiti, Soumi
    Li, Xinjian
    Watanabe, Shinji
    Takamichi, Shinnosuke
    Saruwatari, Hiroshi
    PROCEEDINGS OF THE THIRTY-SECOND INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2023, 2023, : 5179 - 5187
  • [47] Adaptive categorization in unsupervised learning
    Clapper, JP
    Bower, GH
    JOURNAL OF EXPERIMENTAL PSYCHOLOGY-LEARNING MEMORY AND COGNITION, 2002, 28 (05) : 908 - 923
  • [48] Unsupervised categorization and category learning
    Pothos, EM
    Chater, N
    QUARTERLY JOURNAL OF EXPERIMENTAL PSYCHOLOGY SECTION A-HUMAN EXPERIMENTAL PSYCHOLOGY, 2005, 58 (04): : 733 - 752
  • [49] A neural network model for hierarchical multilingual text categorization
    Chau, RN
    Yeh, CS
    Smith, KA
    ADVANCES IN NEURAL NETWORKS - ISNN 2005, PT 2, PROCEEDINGS, 2005, 3497 : 238 - 245
  • [50] Text categorization of multilingual web pages in specific domain
    Liu, Jicheng
    Liang, Chunyan
    ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PROCEEDINGS, 2008, 5012 : 938 - 944