How to Improve E-commerce Search Engines? Evaluating Transformer-Based Named Entity Recognition on German Product Datasets

被引:0
|
作者
Denisov, Sergej [1 ]
Baumer, Frederik S. [1 ]
机构
[1] Bielefeld Univ Appl Sci, Bielefeld, Germany
关键词
Transformer; Named entity recognition; E-commerce;
D O I
10.1007/978-3-030-88304-1_28
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The quality of e-commerce search engines often suffers from data that online retailers poorly maintain. This situation can be observed on consumer-to-consumer marketplaces as well as on business-to-consumer platforms. One way to improve search quality is to perform linguistic enhancement of the product data. In this case, Named Entity Recognition is primarily used to identify important content and give it a higher weighting in the search. Our approach detects e-commerce entity types, such as products, brands, and various product attributes. Because of the low availability of existing resources and linguistic complexity identifying these entity types is challenging. Therefore, we acquire data from two online e-commerce marketplaces to build six German datasets based on product titles and descriptions. For these datasets, we evaluate the NER performance of the state-of-the-art models BERT, RoBERTa, and XLM-RoBERTa. The best performance archived the XLM-RoBERTa model with an F1 score of 0.8611 averaged over all datasets.
引用
收藏
页码:353 / 366
页数:14
相关论文
共 42 条
  • [21] Extracting social determinants of health events with transformer-based multitask, multilabel named entity recognition
    Richie, Russell
    Ruiz, Victor M.
    Han, Sifei
    Shi, Lingyun
    Tsui, Fuchiang
    JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2023, 30 (08) : 1379 - 1388
  • [22] A Weighted Flat Lattice Transformer-based Knowledge Extraction Architecture for Chinese Named Entity Recognition
    Zhang, Hengwei
    Wu, Yuejia
    Zhou, Jian-tao
    PROCEEDINGS OF THE 2024 27 TH INTERNATIONAL CONFERENCE ON COMPUTER SUPPORTED COOPERATIVE WORK IN DESIGN, CSCWD 2024, 2024, : 193 - 198
  • [23] Graph-based Multilingual Product Retrieval in E-Commerce Search
    Lu, Hanqing
    Hu, Youna
    Zhao, Tong
    Wu, Tony
    Song, Yiwei
    Yin, Bing
    2021 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, NAACL-HLT 2021, 2021, : 146 - 153
  • [24] Building Large-Scale Deep Learning System for Entity Recognition in E-Commerce Search
    Wen, Musen
    Vasthimal, Deepak Kumar
    Lu, Alan
    Wang, Tian
    Guo, Aimin
    BDCAT'19: PROCEEDINGS OF THE 6TH IEEE/ACM INTERNATIONAL CONFERENCE ON BIG DATA COMPUTING, APPLICATIONS AND TECHNOLOGIES, 2019, : 149 - 154
  • [25] T-NER: An All-Round Python']Python Library for Transformer-based Named Entity Recognition
    Ushio, Asahi
    Camacho-Collados, Jose
    EACL 2021: THE 16TH CONFERENCE OF THE EUROPEAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: PROCEEDINGS OF THE SYSTEM DEMONSTRATIONS, 2021, : 53 - 62
  • [26] Clustering e-commerce search engines based on their search interface pages using WISE-Cluster
    Lu, Yiyao
    He, Hai
    Peng, Qian
    Meng, Weiyi
    Yu, Clement
    DATA & KNOWLEDGE ENGINEERING, 2006, 59 (02) : 231 - 246
  • [27] Title-Based Product Search - Exemplified in a Chinese E-commerce Portal
    Chen, Chien-Wen
    Cheng, Pu-Jen
    INFORMATION RETRIEVAL TECHNOLOGY, 2010, 6458 : 25 - 36
  • [28] Deep Learning Based Sentiment Aware Ranking for E-commerce Product Search
    Jbene, Mourad
    Tigani, Smail
    ADVANCED INTELLIGENT SYSTEMS FOR SUSTAINABLE DEVELOPMENT (AI2SD'2020), VOL 2, 2022, 1418 : 87 - 97
  • [29] Unsupervised Product Title Optimization Based on Search Behavior Knowledge in E-commerce
    Liu, Shu
    Ye, Zhiqiang
    Liao, Jian
    Wu, Jinxin
    Li, Zhao
    2023 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, IJCNN, 2023,
  • [30] Encountering Product Information: How Flashes of Insight Improve Your Decisions on E-Commerce Platforms
    Wang, Lu
    Zhang, Guangling
    Jiang, Dan
    JOURNAL OF THEORETICAL AND APPLIED ELECTRONIC COMMERCE RESEARCH, 2024, 19 (03): : 2180 - 2197