How to Improve E-commerce Search Engines? Evaluating Transformer-Based Named Entity Recognition on German Product Datasets

被引:0
|
作者
Denisov, Sergej [1 ]
Baumer, Frederik S. [1 ]
机构
[1] Bielefeld Univ Appl Sci, Bielefeld, Germany
关键词
Transformer; Named entity recognition; E-commerce;
D O I
10.1007/978-3-030-88304-1_28
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The quality of e-commerce search engines often suffers from data that online retailers poorly maintain. This situation can be observed on consumer-to-consumer marketplaces as well as on business-to-consumer platforms. One way to improve search quality is to perform linguistic enhancement of the product data. In this case, Named Entity Recognition is primarily used to identify important content and give it a higher weighting in the search. Our approach detects e-commerce entity types, such as products, brands, and various product attributes. Because of the low availability of existing resources and linguistic complexity identifying these entity types is challenging. Therefore, we acquire data from two online e-commerce marketplaces to build six German datasets based on product titles and descriptions. For these datasets, we evaluate the NER performance of the state-of-the-art models BERT, RoBERTa, and XLM-RoBERTa. The best performance archived the XLM-RoBERTa model with an F1 score of 0.8611 averaged over all datasets.
引用
收藏
页码:353 / 366
页数:14
相关论文
共 42 条
  • [31] Transformer-based Named Entity Recognition for Clinical Cancer Drug Toxicity by Positive-unlabeled Learning and KL Regularizers
    Xie, Weixin
    Xu, Jiayu
    Zhao, Chengkui
    Li, Jin
    Han, Shuangze
    Shao, Tianyu
    Wang, Limei
    Feng, Weixing
    CURRENT BIOINFORMATICS, 2024, 19 (08) : 738 - 751
  • [32] Deep Learning-based Semantic Search Techniques for Enhancing Product Matching in E-commerce
    Aamir, Fatima
    Sherafgan, Raheimeen
    Arbab, Tehreem
    Jamil, Akhtar
    Bhatti, Fazeel Nadeem
    Hameed, Alaa Ali
    2024 IEEE 3RD INTERNATIONAL CONFERENCE ON COMPUTING AND MACHINE INTELLIGENCE, ICMI 2024, 2024,
  • [33] Deep Neural Network and Boosting Based Hybrid Quality Ranking for e-Commerce Product Search
    Jbene, Mourad
    Tigani, Smail
    Saadane, Rachid
    Chehri, Abdellah
    BIG DATA AND COGNITIVE COMPUTING, 2021, 5 (03)
  • [34] A Product Feature-Based User-Centric Ranking Model for E-Commerce Search
    Ben Jabeur, Lamjed
    Soulier, Laure
    Tamine, Lynda
    Mousset, Paul
    EXPERIMENTAL IR MEETS MULTILINGUALITY, MULTIMODALITY, AND INTERACTION, CLEF 2016, 2016, 9822 : 174 - 186
  • [35] Entity name recognition of cross-border e-commerce commodity titles based on TWs-LSTM
    Luo, Yongcong
    Ma, Jing
    Li, Chi
    ELECTRONIC COMMERCE RESEARCH, 2020, 20 (02) : 405 - 426
  • [36] Dark Web: E-Commerce Information Extraction Based on Name Entity Recognition Using Bidirectional-LSTM
    Shah, Syed Afeef Ahmed
    Masood, Muhammad Ali
    Yasin, Amanullah
    IEEE ACCESS, 2022, 10 : 99633 - 99645
  • [37] Entity name recognition of cross-border e-commerce commodity titles based on TWs-LSTM
    Yongcong Luo
    Jing Ma
    Chi Li
    Electronic Commerce Research, 2020, 20 : 405 - 426
  • [38] Machine Learning Based Cross-border E-Commerce Commodity Customs Product Name Recognition Algorithm
    Ma, Jing
    Li, Xiaofeng
    Li, Chi
    He, Bo
    Guo, Xiaoyu
    PRICAI 2019: TRENDS IN ARTIFICIAL INTELLIGENCE, PT III, 2019, 11672 : 247 - 256
  • [39] Named Entity Recognition and Relation Extraction for COVID-19: Explainable Active Learning with Word2vec Embeddings and Transformer-Based BERT Models
    Arguello-Casteleiro, M.
    Maroto, N.
    Wroe, C.
    Torrado, C. Sevillano
    Henson, C.
    Des-Diz, J.
    Fernandez-Prieto, M. J.
    Furmston, T.
    Fernandez, D. Maseda
    Kulshrestha, M.
    Stevens, R.
    Keane, J.
    Peters, S.
    ARTIFICIAL INTELLIGENCE XXXVIII, 2021, 13101 : 158 - 163
  • [40] Application of big data search based on collaborative filtering algorithm in cross-border e-commerce product recommendation
    Wu, Xiaoli
    Wu, Zhihao
    SOFT COMPUTING, 2023,