A VISUAL WEBPAGE INFORMATION EXTRACTION FRAMEWORK FOR COMPETITIVE INTELLIGENCE SYSTEM

被引:0
|
作者
Zhang, Zhiwei [1 ]
Qin, Wenbo [1 ]
Xu, Haifeng [1 ]
机构
[1] Suzhou Univ, Sch Informat & Engn, Suzhou, Peoples R China
来源
SCALABLE COMPUTING-PRACTICE AND EXPERIENCE | 2024年 / 25卷 / 05期
关键词
Information extraction visualization; natural language processing; competitive intelligence; data mining;
D O I
10.12694/scpe.v25i5.3078
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
The extraction of webpage information is of paramount importance in the realm of competitive intelligence. This research is dedicated to the design and implementation of a visual webpage information extraction module within a competitive intelligence system, approached through the lens of research and development (R&D) technology and its practical applications. Initially, the study delineates the objectives and requirements for webpage information extraction, emphasizing the practical needs of competitive intelligence systems. By critically assessing the strengths and weaknesses of current theories and methodologies in webpage text information extraction, this paper introduces an innovative visual method for extracting webpage text information. Subsequently, the paper meticulously outlines the comprehensive architecture of the proposed module. Building upon this foundation, the study delves into the specifics of the extraction template, rule generation, optimization techniques, and the extraction algorithm pivotal to the process of visual webpage information extraction. The system's effectiveness and practical utility are substantiated through a series of confirmatory experiments, the results of which are thoroughly analyzed. The findings affirm that the developed system adeptly fulfills the webpage information extraction needs of competitive intelligence systems, contributing significantly to the R&D efforts in which the authors are engaged.
引用
收藏
页码:4138 / 4152
页数:15
相关论文
共 50 条
  • [1] CoVA: Context-aware Visual Attention for Webpage Information Extraction
    Kumar, Anurendra
    Morabia, Keval
    Wang, Jingjin
    Chang, Kevin Chen-Chuan
    Schwing, Alexander
    PROCEEDINGS OF THE 5TH WORKSHOP ON E-COMMERCE AND NLP (ECNLP 5), 2022, : 80 - 90
  • [2] Information extraction and webpage understanding
    Sharmila Begum, M.
    Dinesh, L.
    Aruna, P.
    International Journal of Computer Science Issues, 2011, 8 (6 6-3): : 304 - 308
  • [3] Research on Intelligent Extraction of Webpage Information
    Li, Likun
    2014 2ND INTERNATIONAL CONFERENCE IN HUMANITIES, SOCIAL SCIENCES AND GLOBAL BUSINESS MANAGEMENT (ISSGBM 2014), VOL 29, 2014, 29 : 213 - 217
  • [4] WEBPAGE SEGMENTATION USING VISUAL INFORMATION
    Hong, Jer Lang
    Tan, Wan Cyhn
    UNCERTAINTY MODELLING IN KNOWLEDGE ENGINEERING AND DECISION MAKING, 2016, 10 : 398 - 404
  • [5] A Framework for Information Systems Innovation: A Case of Competitive Intelligence in Organisations
    Nemutanzhela, Phathutshedzo
    Iyamu, Tiko
    PROCEEDINGS OF THE 2ND INTERNATIONAL CONFERENCE ON INFORMATION MANAGEMENT AND EVALUATION, 2011, : 341 - 350
  • [6] Interpreting, analyzing and distributing information: A big data framework for competitive intelligence
    Casarotto, Eduardo Luis
    Malafaia, Guilherme Cunha
    Martinez, Marta Pagan
    Binotto, Erlaine
    JOURNAL OF INTELLIGENCE STUDIES IN BUSINESS, 2021, 11 (01): : 6 - 18
  • [7] Relation Extraction for Competitive Intelligence
    Collovini, Sandra
    Goncalves, Patricia Nunes
    Cavalheiro, Guilherme
    Santos, Joaquim
    Vieira, Renata
    COMPUTATIONAL PROCESSING OF THE PORTUGUESE LANGUAGE, PROPOR 2020, 2020, 12037 : 249 - 258
  • [8] Business intelligence in the logistic domain using visual information extraction
    Prado, Jose Aldo Diaz
    INTERNET & INFORMATION SYSTEMS IN THE DIGITAL AGE: CHALLENGES AND SOLUTIONS, 2006, : 700 - +
  • [9] An FAR-SW based approach for webpage information extraction
    Bu, Zhan
    Zhang, Chengcui
    Xia, Zhengyou
    Wang, Jiandong
    INFORMATION SYSTEMS FRONTIERS, 2014, 16 (05) : 771 - 785
  • [10] Competitive intelligence for information professionals
    McLean, Michelle
    AUSTRALIAN LIBRARY JOURNAL, 2016, 65 (01): : 60 - 61