Learning to Rank Hypernyms of Financial Terms Using Semantic Textual Similarity

被引:0
|
作者
Ghosh S. [1 ,2 ]
Chopra A. [3 ]
Naskar S.K. [2 ]
机构
[1] Fidelity Investments, Karnataka, Bengaluru
[2] Jadavpur University, West Bengal, Kolkata
[3] Tredence Analytics, Karnataka, Bengaluru
关键词
Financial texts; Hypernym ranking; Natural language processing; Text similarity;
D O I
10.1007/s42979-023-02134-z
中图分类号
学科分类号
摘要
Over the years, with the advancement of digitalization, investors have started embracing the online mode of performing financial activities. Most investors prefer to read contents over the Internet before making decisions. The financial services’ industry has terms and concepts that are complex and difficult to understand. To fully comprehend these contents, one needs to have a thorough understanding of these terms. Getting a basic idea about a term becomes easy when it is explained with the help of the broad category to which it belongs. This broad category is referred to as hypernym. In this paper, we propose a system capable of extracting and ranking hypernyms for a given financial term. The system has been trained with financial text corpora obtained from various sources. Embeddings of financial terms have been extracted using domain-specific embeddings and fine-tuned using SentenceBERT as reported by Reimers (in: Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP), Association for Computational Linguistics, Hong Kong, 2019). A novel approach has been used to augment the training set with negative samples. Finally, we benchmark the system performance with that of the existing ones. We establish that it performs better than the existing ones and is also scalable. © 2023, The Author(s), under exclusive licence to Springer Nature Singapore Pte Ltd.
引用
收藏
相关论文
共 50 条
  • [1] Learning Semantic Textual Similarity from Conversations
    Yang, Yinfei
    Yuan, Steve
    Cer, Daniel
    Kong, Sheng-yi
    Constant, Noah
    Pilar, Petr
    Ge, Heming
    Sung, Yun-Hsuan
    Strope, Brian
    Kurzweil, Ray
    REPRESENTATION LEARNING FOR NLP, 2018, : 164 - 174
  • [2] Efficient Textual Similarity using Semantic MinHashing
    Nawaz, Waqas
    Baig, Maryam
    Khan, Kifayat Ullah
    2024 IEEE INTERNATIONAL CONFERENCE ON BIG DATA AND SMART COMPUTING, IEEE BIGCOMP 2024, 2024, : 262 - 269
  • [3] Semantic Textual Similarity Using Various Approaches
    Kazula, Maciej
    Kozlowski, Marek
    MACHINE INTELLIGENCE AND BIG DATA IN INDUSTRY, 2016, 19 : 49 - 62
  • [4] Linking Datasets Using Semantic Textual Similarity
    McCrae, John P.
    Buitelaar, Paul
    CYBERNETICS AND INFORMATION TECHNOLOGIES, 2018, 18 (01) : 109 - 123
  • [5] Predicting Semantic Textual Similarity of Arabic Question Pairs using Deep Learning
    Einea, Omar
    Elnagar, Ashraf
    2019 IEEE/ACS 16TH INTERNATIONAL CONFERENCE ON COMPUTER SYSTEMS AND APPLICATIONS (AICCSA 2019), 2019,
  • [6] Semantic textual similarity for modern standard and dialectal Arabic using transfer learning
    Sulaiman, Mansour Al
    Moussa, Abdullah M.
    Abdou, Sherif
    Elgibreen, Hebah
    Faisal, Mohammed
    Rashwan, Mohsen
    PLOS ONE, 2022, 17 (08):
  • [7] Spectral Learning of Semantic Units in a Sentence Pair to Evaluate Semantic Textual Similarity
    Mehndiratta, Akanksha
    Asawa, Krishna
    8TH INTERNATIONAL CONFERENCE ON BIG DATA ANALYTICS, BDA 2020, 2020, 12581 : 49 - 59
  • [8] Question Similarity Detection in Turkish Using Semantic Textual Similarity Methods
    Yildiz, Eray
    Findik, Yasin
    2019 27TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE (SIU), 2019,
  • [9] Deep learning based Bengali question answering system using semantic textual similarity
    Das, Arijit
    Saha, Diganta
    MULTIMEDIA TOOLS AND APPLICATIONS, 2022, 81 (01) : 589 - 613
  • [10] Deep learning based Bengali question answering system using semantic textual similarity
    Arijit Das
    Diganta Saha
    Multimedia Tools and Applications, 2022, 81 : 589 - 613