Information Extraction Based on Table Area Locating for E-Commerce, Websites

被引:0
|
作者
Ouyang, Liubo [1 ]
Dong, Rui [1 ]
Zou, Beiji [2 ]
机构
[1] Hunan Univ, Software Sch, Changsha 410082, Hunan, Peoples R China
[2] Cent S Univ, Sch Informat Sci & Engn, Changsha 410083, Peoples R China
关键词
Web Tables; DOM tree; Area location; Information extraction;
D O I
10.1109/GCIS.2009.310
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Efficient extracting merchandise information is the Key technology for e-commerce searching engine. B-v analyzing web table characters of HTML pages of e-commerce websites, this article proposes the notion of table area locating, and decomposes the merchandise information extraction into three key processes searching Preparative Core Areas (PCA), locating Core Area (CA) and extracting attribute values from Core-Area, and then design the algorithm of locating Core Area and the algorithm of extracting attributes names and values. We experimented with the new approach on some HTML pages from various e-commerce websites. The results indicate that this approach can locate merchandise information area and extract attributes names and values efficiently, and have better performance of precise and recall.
引用
收藏
页码:441 / +
页数:2
相关论文
共 50 条
  • [1] An information architecture-based evaluating model for e-commerce websites
    Shen, Bo
    Xu, Shenghua
    Fifth Wuhan International Conference on E-Business, Vols 1-3: INTEGRATION AND INNOVATION THROUGH MEASUREMENT AND MANAGEMENT, 2006, : 747 - 751
  • [2] Domain dependent product feature and opinion extraction based on E-commerce websites
    Twardowski, B. (B.Twardowski@ii.pw.edu.pl), 1600, Springer Verlag (183 AISC):
  • [3] Valuation of Personal Information in the E-commerce Websites based on Contingent Valuation Method
    Huang, Yijun
    Lu, Tong
    PROCEEDINGS OF THE 8TH INTERNATIONAL CONFERENCE ON E-BUSINESS, MANAGEMENT AND ECONOMICS (ICEME 2017), 2015, : 21 - 27
  • [4] A Culture-based Study on Information Density of Mobile E-commerce Websites
    Chu, Junjie
    Li, Min
    2012 FIFTH INTERNATIONAL SYMPOSIUM ON COMPUTATIONAL INTELLIGENCE AND DESIGN (ISCID 2012), VOL 2, 2012, : 266 - 269
  • [5] Analysis of Trust Presence Within E-Commerce Websites: A Study of Indonesian E-Commerce Websites
    Shihab, Muhammad R.
    Wahyuni, Sri
    Hidayanto, A. N.
    2014 INTERNATIONAL CONFERENCE ON ADVANCED COMPUTER SCIENCE AND INFORMATION SYSTEMS (ICACSIS), 2014, : 133 - 138
  • [6] Analysis of the Present Applications of Information Visualization in E-commerce Websites
    Zhao, Dan
    Zhou, Ning
    2008 4TH INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS, NETWORKING AND MOBILE COMPUTING, VOLS 1-31, 2008, : 9331 - 9334
  • [7] A Modified WASPAS Method for the Evaluation of E-Commerce Websites Based on Pythagorean Fuzzy Information
    Ou, Xiufang
    Chen, Bingbin
    IEEE ACCESS, 2025, 13 : 9303 - 9312
  • [8] An efficient mechanism for product data extraction from e-commerce websites
    Akhtar, Malik Javed
    Ahmad, Zahur
    Amin, Rashid
    Almotiri, Sultan H.
    Al Ghamdi, Mohammed A.
    Aldabbas, Hamza
    Computers, Materials and Continua, 2020, 65 (03): : 2639 - 2663
  • [9] An Efficient Mechanism for Product Data Extraction from E-Commerce Websites
    Akhtar, Malik Javed
    Ahmad, Zahur
    Amin, Rashid
    Almotiri, Sultan H.
    Al Ghamdi, Mohammed A.
    Aldabbas, Hamza
    CMC-COMPUTERS MATERIALS & CONTINUA, 2020, 65 (03): : 2639 - 2663
  • [10] The information management and organization applied in tourism e-commerce based on the role of websites in China
    Wei, Min
    Sixth Wuhan International Conference on E-Business, Vols 1-4: MANAGEMENT CHALLENGES IN A GLOBAL WORLD, 2007, : 909 - 914