Using Semantic Similarity for Identifying Relevant Page Numbers for Indexed Term of Textual Book

被引:0
|
作者
Siahaan, Daniel [1 ]
Christina, Sherly [2 ]
机构
[1] Inst Teknol Sepuluh Nopember, Dept Informat, Surabaya, Indonesia
[2] Univ Palangkaraya, Dept Informat, Palangkaraya, Indonesia
关键词
book indexing; back-of-book index; relevant page number; semantic relation; AGREEMENT;
D O I
10.1007/978-3-662-46742-8_17
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Back-of-book index page is one of navigation tools for reader. It helps reader to immediately jump to a page that contains relevant information regarding a specific term. It helps reader to retrieve information about specific topics in mind without having to read the complete book. Indexed terms are usually determined by author based on one's subjective preference on what indications should be used to decide whether a term should be indexed and what pages are relevant. Therefore, indexing a book inherits subjectivity of author side. The book size is proportional to the indexing effort and consistency. This leads to the fact that page numbers are not always referred to relevant pages. This paper proposes an approach to identify relevancy of a page that contains an indexed term. This approach measures the semantic relation between indexed term with the respective sentence in the page. To measure the semantic relation, the approach utilizes semantic distance algorithm that based on Wordnet thesaurus. We measure the reliability of our system by measuring its degree of agreement with the book indexer using kappa statistics. The experimental result shows that the proposed approach are considered as good as the domain expert, given average kappa value 0.6034.
引用
收藏
页码:183 / 192
页数:10
相关论文
共 41 条
  • [21] Evaluating Question generation models using QA systems and Semantic Textual Similarity
    Shaheer, Safwan
    Hossain, Ishmam
    Sarna, Sudipta Nandi
    Mehedi, Md Humaion Kabir
    Rasel, Annajiat Alim
    2023 IEEE 13TH ANNUAL COMPUTING AND COMMUNICATION WORKSHOP AND CONFERENCE, CCWC, 2023, : 431 - 435
  • [22] Deep learning based Bengali question answering system using semantic textual similarity
    Das, Arijit
    Saha, Diganta
    MULTIMEDIA TOOLS AND APPLICATIONS, 2022, 81 (01) : 589 - 613
  • [23] Using Textual Semantic Similarity to Improve Clustering Quality of Web Video Search Results
    Phuc Quang Nguyen
    Anh-Thu Nguyen-Thi
    Thanh Duc Ngo
    Tu-Anh Hoang Nguyen
    2015 SEVENTH INTERNATIONAL CONFERENCE ON KNOWLEDGE AND SYSTEMS ENGINEERING (KSE), 2015, : 156 - 161
  • [24] Deep learning based Bengali question answering system using semantic textual similarity
    Arijit Das
    Diganta Saha
    Multimedia Tools and Applications, 2022, 81 : 589 - 613
  • [25] Sentence-Level Semantic Textual Similarity Using Word-Level Semantics
    Shajalal, Md
    Aono, Masaki
    2018 10TH INTERNATIONAL CONFERENCE ON ELECTRICAL AND COMPUTER ENGINEERING (ICECE), 2018, : 113 - 116
  • [26] Evaluating Semantic Textual Similarity in Clinical Sentences Using Deep Learning and Sentence Embeddings
    Antunes, Rui
    Silva, Joao Figueira
    Matos, Sergio
    PROCEEDINGS OF THE 35TH ANNUAL ACM SYMPOSIUM ON APPLIED COMPUTING (SAC'20), 2020, : 662 - 669
  • [27] SEMGROMI-a semantic grouping algorithm to identifying microservices using semantic similarity of user stories
    Vera-Rivera, Fredy H.
    Cuadros, Eduard Gilberto Puerto
    Perez, Boris
    Astudillo, Hernan
    Gaona, Carlos
    PEERJ COMPUTER SCIENCE, 2023, 9
  • [28] Predicting learning performance using NLP: an exploratory study using two semantic textual similarity methods
    Papadimas, C.
    Ragazou, V.
    Karasavvidis, I.
    Kollias, V.
    KNOWLEDGE AND INFORMATION SYSTEMS, 2025, : 4567 - 4595
  • [29] Identifying Protein Complexes from PPI Networks Using GO Semantic Similarity
    Wang, Jian
    Xie, Dong
    Lin, Hongfei
    Yang, Zhihao
    Zhang, Yijia
    2011 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE (BIBM 2011), 2011, : 582 - 585
  • [30] An optimized approach for massive web page classification using entity similarity based on semantic network
    Li, Huakang
    Xu, Zheng
    Li, Tao
    Sun, Guozi
    Choo, Kim-Kwang Raymond
    FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2017, 76 : 510 - 518