Ontology Extraction Considering Content Concordance from Tagging to Web Pages in Similar SBM Users

被引:0
|
作者
Harada, Fumiko [1 ]
Shimakawa, Hiromitsu [1 ]
机构
[1] Ritsumeikan Univ, Fac Comp Sci, Dept Informat Sci & Engn, Kusatsu, Shiga 5258577, Japan
关键词
personal phrase meaning; tagging; social bookmark; similar user;
D O I
10.1109/IIAI-AAI.2013.45
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
To realize web search engines with considering meaning of query phrases for each user, we have studied a method to extract hierarchical and synonymous relationships among tagged phrases on a social bookmark (SBM) for an individual SBM user. It detects the relationships from webpage clusters with same tagged phrases derived from the bookmarks shared in the target and his similar SBM users. However, noisy tagging violating personal phrase meaning degrades its detection accuracy. This paper proposes a method to improve such drawback. The proposed method classifies webpages based on its content concordance as long as based on sameness of tagged phrases. Analyzing webpages belongingness to content-based and tag-based clusters, the relationships are detected more accurately. We compared the detection accuracies of the proposed and traditional methods through an experiment. For hierarchical relationships, the F-measure improves by 7.41% and the precision improves by 20.94% under guaranteeing more than 20% recall. For synonymous one, the F-measure does by 4.17% and the precision does by 21.80% under more than 10% recall.
引用
收藏
页码:289 / 295
页数:7
相关论文
共 50 条
  • [21] Content Information Extraction of Theme Web Pages based on Tag Information
    Wang, Jie
    Wu, Jian
    Zhang, Yafeng
    He, Guowan
    2014 SEVENTH INTERNATIONAL SYMPOSIUM ON COMPUTATIONAL INTELLIGENCE AND DESIGN (ISCID 2014), VOL 1, 2014, : 501 - 504
  • [22] Opinion Content Extraction from Web Pages Using Embedded Semantic Term Tree Kernels
    Pagi, Veerappa B.
    Wadawadagi, Ramesh S.
    PROCEEDINGS OF INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND DATA ENGINEERING, 2018, 9 : 345 - 358
  • [23] WEB2ONTO: Automatic Ontology Construction Approach from Web pages
    Elmesalmy, Naglaa
    Hadhoud, Mayada
    Fayeka, Magda
    2019 15TH INTERNATIONAL COMPUTER ENGINEERING CONFERENCE (ICENCO 2019), 2019, : 175 - 182
  • [24] Turkish Keyphrase Extraction from Web Pages with BERT
    Ayan, Emre Tolga
    Arslan, Rabia
    Zengin, Muhammed Said
    Duru, Haci Ali
    Salman, Sedat
    Bardak, Batuhan
    29TH IEEE CONFERENCE ON SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS (SIU 2021), 2021,
  • [25] Structrued and semantic data extraction from Web pages
    Gan, Y
    Zhang, SZ
    PROCEEDINGS OF THE 2004 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-7, 2004, : 2930 - 2935
  • [26] Extraction of web news from web pages using a ternary tree approach
    Laishram, Debina
    Sebastian, Merin
    2015 SECOND INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTING AND COMMUNICATION ENGINEERING ICACCE 2015, 2015, : 628 - 633
  • [27] Effectual Web Content Mining using Noise Removal from Web Pages
    P. Sivakumar
    Wireless Personal Communications, 2015, 84 : 99 - 121
  • [28] Extracting Topic Maps from Web Pages by Web Link Structure and Content
    Mase, Motohiro
    Yamada, Seiji
    Nitta, Katsumi
    2008 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION, VOLS 1-8, 2008, : 1232 - +
  • [29] Effectual Web Content Mining using Noise Removal from Web Pages
    Sivakumar, P.
    WIRELESS PERSONAL COMMUNICATIONS, 2015, 84 (01) : 99 - 121
  • [30] Person Attribute Extraction from the Textual Parts of Web Pages
    Istvan, Nagy T.
    ACTA CYBERNETICA, 2012, 20 (03): : 419 - 440