Tibetan-Chinese Cross-language Topic Extraction and Alignment

被引:0
|
作者
Sun, Yuan [1 ,2 ]
Zhao, Qian [1 ,2 ]
Yuan, Wolerrg [1 ,2 ]
机构
[1] Minzu Univ China, Sch Informat Engn, Beijing 100081, Peoples R China
[2] Minzu Univ China, Natl Language Resource & Monitoring Res Ctr, Minor Languages Branch, Beijing 100081, Peoples R China
关键词
Tibetan-Chinese; topic extraction; topic alignment;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Getting information about Tibetan topics in other languages is the basis and the most critical module of cross-language topic detection and tracking. This paper firstly extracts Tibetan-Chinese topics using LDA model, and then proposes a voting method based on cosine distance, Euclidean distance, Hellinger distance and KL distance to realize Tibetan-Chinese cross-language topic alignment. Finally, the experimental results prove that the method is effective.
引用
收藏
页码:67 / 71
页数:5
相关论文
共 50 条
  • [1] Chinese-Thai Cross-Language Topic Extraction and Alignment
    Li, Xia
    Zeng, ZiHang
    Zhang, JianShu
    Jiang, ShengYi
    2017 INTERNATIONAL CONFERENCE ON ASIAN LANGUAGE PROCESSING (IALP), 2017, : 239 - 242
  • [2] Research on Tibetan-Chinese Cross-language Information Retrieval System Base on E-commerce Platform
    Wan, Fucheng
    Zhu, Lin
    PROCEEDINGS OF THE 2017 2ND INTERNATIONAL CONFERENCE ON AUTOMATIC CONTROL AND INFORMATION ENGINEERING (ICACIE 2017), 2017, 119 : 145 - 149
  • [3] Tibetan-Chinese Cross Language Named Entity Extraction Based on Comparable Corpus and Naturally Annotated Resources
    Sun, Yuan
    Guo, Wenbin
    Zhao, Xiaobing
    2014 IEEE SYMPOSIUM ON COMPUTATIONAL INTELLIGENCE AND DATA MINING (CIDM), 2014, : 288 - 295
  • [4] Study on Tibetan-Chinese Comparable Corpus Extraction
    Sun, Yuan
    Guo, Li-li
    INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND COMPUTER SCIENCE (AICS 2016), 2016, : 287 - 293
  • [5] A New Method for Word Alignment of Tibetan-Chinese Machine Translation
    Wan, Fucheng
    He, Xiangzhen
    Yu, Hongzhi
    ADVANCES IN TEXTILE ENGINEERING AND MATERIALS IV, 2014, 1048 : 521 - 525
  • [6] Learning Tibetan-Chinese cross-lingual word embeddings
    Ma, Wei
    Yu, Hongzhi
    Zhao, Kun
    Zhao, Deshun
    2019 15TH INTERNATIONAL CONFERENCE ON SEMANTICS, KNOWLEDGE AND GRIDS (SKG 2019), 2019, : 49 - 53
  • [7] Unsupervised Deep Cross-Language Entity Alignment
    Jiang, Chuanyu
    Qian, Yiming
    Chen, Lijun
    Gu, Yang
    Xie, Xia
    MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES: RESEARCH TRACK, ECML PKDD 2023, PT IV, 2023, 14172 : 3 - 19
  • [8] Tibetan Syntactic Parsing for Tibetan-Chinese Machine Translation
    Wan, Fu-cheng
    Yu, Hong-zhi
    Wu, Xi-hong
    He, Xiang-zhen
    INTERNATIONAL CONFERENCE ON ADVANCED COMPUTER SCIENCE AND ENGINEERING (ACSE 2014), 2014, : 359 - 364
  • [9] Tibetan-Chinese Cross-Lingual Sentiment Classification Based on Adversarial Network
    Zhang, Tingting
    Jiang, Tao
    Shan, Ruikang
    2021 INTERNATIONAL CONFERENCE ON ASIAN LANGUAGE PROCESSING (IALP), 2021, : 340 - 345
  • [10] Key Techniques of Cross-Language Medical Term Alignment
    Yang, Yuqi
    Zhang, Guangzhi
    Bie, Rongfang
    Kim, Sungjoong
    Shin, Dongil
    2016 INTERNATIONAL CONFERENCE ON IDENTIFICATION, INFORMATION AND KNOWLEDGE IN THE INTERNET OF THINGS (IIKI), 2016, : 279 - 286