Mining Summary of Short Text with Centroid Similarity Distance

被引:1
|
作者
Franciscus, Nigel [1 ]
Wang, Junhu [1 ]
Stantic, Bela [1 ]
机构
[1] Inst Integrated & Intelligent Syst, Brisbane, Qld, Australia
关键词
Text summarization; Short text; Word embeddings;
D O I
10.1007/978-3-030-35231-8_32
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Text summarization aims at producing a concise summary that preserves key information. Many textual inputs are short and do not fit with the standard longer text-based techniques. Most of the existing short text summarization approaches rely on metadata information such as the authors or reply networks. However, not all raw textual data can provide such information. In this paper, we present our method to summarize short text using a centroid-based method with word embeddings. In particular, we consider the task when there is no metadata information other than the text itself. We show that the centroid embeddings approach can be applied to short text to capture semantically similar sentences for summarization. With further clustering strategy, we were able to identify relevant sub-topics that further improves the context diversity in the overall summary. The empirical evaluation demonstrates that our approach can outperform other methods on two annotated LREC track dataset.
引用
收藏
页码:447 / 461
页数:15
相关论文
共 50 条
  • [1] Few-shot short-text classification with language representations and centroid similarity
    Wenfu Liu
    Jianmin Pang
    Nan Li
    Feng Yue
    Guangming Liu
    Applied Intelligence, 2023, 53 : 8061 - 8072
  • [2] Few-shot short-text classification with language representations and centroid similarity
    Liu, Wenfu
    Pang, Jianmin
    Li, Nan
    Yue, Feng
    Liu, Guangming
    APPLIED INTELLIGENCE, 2023, 53 (07) : 8061 - 8072
  • [3] Research on Semantic Similarity of Short Text Based on Bert and Time Warping Distance
    Qiu, Shijie
    Niu, Yan
    Li, Jun
    Li, Xing
    JOURNAL OF WEB ENGINEERING, 2021, 20 (08): : 2521 - 2543
  • [4] Text mining: identification of similarity of text documents using hybrid similarity model
    K. M. Shiva Prasad
    Iran Journal of Computer Science, 2023, 6 (2) : 123 - 135
  • [5] TAXES ON MINING - SHORT SUMMARY
    BROWN, RD
    CIM BULLETIN, 1977, 70 (783): : 144 - 146
  • [6] An effective short text conceptualization based on new short text similarity
    Bekkali, Mohammed
    Lachkar, Abdelmonaime
    SOCIAL NETWORK ANALYSIS AND MINING, 2018, 9 (01)
  • [7] Similarity measures for short segments of text
    Metzler, Donald
    Dumais, Susan
    Meek, Christopher
    ADVANCES IN INFORMATION RETRIEVAL, 2007, 4425 : 16 - +
  • [8] Benchmarking short text semantic similarity
    O'Shea J.
    Bandar Z.
    Crockett K.
    McLean D.
    International Journal of Intelligent Information and Database Systems, 2010, 4 (02) : 103 - 120
  • [9] Combining Coupled Distance Discrimination and Strong Classifica- tion Features for Short Text Similarity Calculation
    Ma H.-F.
    Liu W.
    Li Z.-X.
    Lin X.-H.
    Tien Tzu Hsueh Pao/Acta Electronica Sinica, 2019, 47 (06): : 1331 - 1336
  • [10] Text summary evaluation based on interpretable semantic textual similarity
    Goutam Majumder
    Vikrant Rajput
    Partha Pakray
    Sivaji Bandyopadhyay
    Benoit Favre
    Multimedia Tools and Applications, 2024, 83 : 3233 - 3258