Using Semantic Similarity for Identifying Relevant Page Numbers for Indexed Term of Textual Book

被引:0
|
作者
Siahaan, Daniel [1 ]
Christina, Sherly [2 ]
机构
[1] Inst Teknol Sepuluh Nopember, Dept Informat, Surabaya, Indonesia
[2] Univ Palangkaraya, Dept Informat, Palangkaraya, Indonesia
关键词
book indexing; back-of-book index; relevant page number; semantic relation; AGREEMENT;
D O I
10.1007/978-3-662-46742-8_17
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Back-of-book index page is one of navigation tools for reader. It helps reader to immediately jump to a page that contains relevant information regarding a specific term. It helps reader to retrieve information about specific topics in mind without having to read the complete book. Indexed terms are usually determined by author based on one's subjective preference on what indications should be used to decide whether a term should be indexed and what pages are relevant. Therefore, indexing a book inherits subjectivity of author side. The book size is proportional to the indexing effort and consistency. This leads to the fact that page numbers are not always referred to relevant pages. This paper proposes an approach to identify relevancy of a page that contains an indexed term. This approach measures the semantic relation between indexed term with the respective sentence in the page. To measure the semantic relation, the approach utilizes semantic distance algorithm that based on Wordnet thesaurus. We measure the reliability of our system by measuring its degree of agreement with the book indexer using kappa statistics. The experimental result shows that the proposed approach are considered as good as the domain expert, given average kappa value 0.6034.
引用
收藏
页码:183 / 192
页数:10
相关论文
共 41 条
  • [31] Identifying High-Priority Proteins Across the Human Diseasome Using Semantic Similarity
    Lau, Edward
    Venkatraman, Vidya
    Thomas, Cody T.
    Wu, Joseph C.
    Van Eyk, Jennifer E.
    Lam, Maggie P. Y.
    JOURNAL OF PROTEOME RESEARCH, 2018, 17 (12) : 4267 - 4278
  • [32] Automatic Bangla Text Summarization Using Term Frequency and Semantic Similarity Approach
    Sarkar, Avik
    Hossen, Md Sharif
    2018 21ST INTERNATIONAL CONFERENCE OF COMPUTER AND INFORMATION TECHNOLOGY (ICCIT), 2018,
  • [33] Semantic enrichment for BIM-based building energy performance simulations using semantic textual similarity and fine-tuning multilingual LLM
    Forth, Kasimir
    Borrmann, Andre
    JOURNAL OF BUILDING ENGINEERING, 2024, 95
  • [34] Using semantic and phonetic term similarity for spoken document retrieval and spoken query processing
    Crestani, F
    TECHNOLOGIES FOR CONSTRUCTING INTELLIGENT SYSTEMS 1: TASKS, 2002, 89 : 363 - 375
  • [35] Identifying most relevant controls on catchment hydrological similarity using model transferability-A comprehensive study in Iran
    Jahanshahi, Afshin
    Patil, Sopan D.
    Goharian, Erfan
    JOURNAL OF HYDROLOGY, 2022, 612
  • [36] Implementation of Semantic Textual Similarity between Requirement Specification and Use Case Description Using WUP Method (Case Study: Sipjabs Application)
    Sari, Elsa Jelista
    Priyadi, Yudi
    Riskiana, Rosa Reska
    2022 IEEE WORLD AI IOT CONGRESS (AIIOT), 2022, : 681 - 687
  • [37] Incorporating Domain Knowledge Into Language Models by Using Graph Convolutional Networks for Assessing Semantic Textual Similarity: Model Development and Performance Comparison
    Chang, David
    Lin, Eric
    Brandt, Cynthia
    Taylor, Richard Andrew
    JMIR MEDICAL INFORMATICS, 2021, 9 (11)
  • [38] IDENTIFICATION OF DROUGHT-INDUCED TRANSCRIPTION FACTORS IN Sorghum bicolor USING GO TERM SEMANTIC SIMILARITY
    Sekhwal, Manoj Kumar
    Swami, Ajit Kumar
    Sharma, Vinay
    Sarin, Renu
    CELLULAR & MOLECULAR BIOLOGY LETTERS, 2015, 20 (01) : 1 - 23
  • [39] PhenClust, a standalone tool for identifying trends within sets of biological phenotypes using semantic similarity and the Unified Medical Language System metathesaurus
    Wilson, Jennifer L.
    Wong, Mike
    Stepanov, Nicholas
    Petkovic, Dragutin
    Altman, Russ
    JAMIA OPEN, 2021, 4 (03)
  • [40] Experiments with Document Retrieval from Small Text Collections Using Latent Semantic Analysis or Term Similarity with Query Coordination and Automatic Relevance Feedback
    Layfield, Colin
    Azzopardi, Joel
    Staff, Chris
    SEMANTIC KEYWORD-BASED SEARCH ON STRUCTURED DATA SOURCES, IKC 2016, 2017, 10151 : 25 - 36