Using Semantic Similarity for Identifying Relevant Page Numbers for Indexed Term of Textual Book

被引:0
|
作者
Siahaan, Daniel [1 ]
Christina, Sherly [2 ]
机构
[1] Inst Teknol Sepuluh Nopember, Dept Informat, Surabaya, Indonesia
[2] Univ Palangkaraya, Dept Informat, Palangkaraya, Indonesia
关键词
book indexing; back-of-book index; relevant page number; semantic relation; AGREEMENT;
D O I
10.1007/978-3-662-46742-8_17
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Back-of-book index page is one of navigation tools for reader. It helps reader to immediately jump to a page that contains relevant information regarding a specific term. It helps reader to retrieve information about specific topics in mind without having to read the complete book. Indexed terms are usually determined by author based on one's subjective preference on what indications should be used to decide whether a term should be indexed and what pages are relevant. Therefore, indexing a book inherits subjectivity of author side. The book size is proportional to the indexing effort and consistency. This leads to the fact that page numbers are not always referred to relevant pages. This paper proposes an approach to identify relevancy of a page that contains an indexed term. This approach measures the semantic relation between indexed term with the respective sentence in the page. To measure the semantic relation, the approach utilizes semantic distance algorithm that based on Wordnet thesaurus. We measure the reliability of our system by measuring its degree of agreement with the book indexer using kappa statistics. The experimental result shows that the proposed approach are considered as good as the domain expert, given average kappa value 0.6034.
引用
收藏
页码:183 / 192
页数:10
相关论文
共 41 条
  • [1] Efficient Textual Similarity using Semantic MinHashing
    Nawaz, Waqas
    Baig, Maryam
    Khan, Kifayat Ullah
    2024 IEEE INTERNATIONAL CONFERENCE ON BIG DATA AND SMART COMPUTING, IEEE BIGCOMP 2024, 2024, : 262 - 269
  • [2] Semantic Textual Similarity Using Various Approaches
    Kazula, Maciej
    Kozlowski, Marek
    MACHINE INTELLIGENCE AND BIG DATA IN INDUSTRY, 2016, 19 : 49 - 62
  • [3] Linking Datasets Using Semantic Textual Similarity
    McCrae, John P.
    Buitelaar, Paul
    CYBERNETICS AND INFORMATION TECHNOLOGIES, 2018, 18 (01) : 109 - 123
  • [4] Question Similarity Detection in Turkish Using Semantic Textual Similarity Methods
    Yildiz, Eray
    Findik, Yasin
    2019 27TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE (SIU), 2019,
  • [5] Calculation of Textual Similarity Using Semantic Relatedness Functions
    Kairaldeen, Ammar Riadh
    Ercan, Gonenc
    COMPUTATIONAL LINGUISTICS AND INTELLIGENT TEXT PROCESSING (CICLING 2015), PT II, 2015, 9042 : 516 - 524
  • [6] Multilingual Semantic Textual Similarity using Multilingual Word Representations
    Ahmed, Mahtab
    Dixit, Chahna
    Mercer, Robert E.
    Khan, Atif
    Samee, Muhammad Rifayat
    Urra, Felipe
    2020 IEEE 14TH INTERNATIONAL CONFERENCE ON SEMANTIC COMPUTING (ICSC 2020), 2020, : 194 - 198
  • [7] Semantic textual similarity between sentences using bilingual word semantics
    Shajalal, Md
    Aono, Masaki
    PROGRESS IN ARTIFICIAL INTELLIGENCE, 2019, 8 (02) : 263 - 272
  • [8] Learning to Rank Hypernyms of Financial Terms Using Semantic Textual Similarity
    Ghosh S.
    Chopra A.
    Naskar S.K.
    SN Computer Science, 4 (5)
  • [9] Identify the Relevant Pages of Book to be Indexed Using Naive Bayes Classification Method
    Christina, S.
    Ronaldo, D.
    3RD INTERNATIONAL CONFERENCE ON ENGINEERING TECHNOLOGY FOR SUSTAINABLE DEVELOPMENT (ICET4SD), 2020, 722
  • [10] Semantic Textual Similarity in Japanese Clinical Domain Texts Using BERT
    Mutinda, Faith Wavinya
    Yada, Shuntaro
    Wakamiya, Shoko
    Aramaki, Eiji
    METHODS OF INFORMATION IN MEDICINE, 2021, 60 : E56 - E64