How to Improve E-commerce Search Engines? Evaluating Transformer-Based Named Entity Recognition on German Product Datasets

被引:0
|
作者
Denisov, Sergej [1 ]
Baumer, Frederik S. [1 ]
机构
[1] Bielefeld Univ Appl Sci, Bielefeld, Germany
关键词
Transformer; Named entity recognition; E-commerce;
D O I
10.1007/978-3-030-88304-1_28
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The quality of e-commerce search engines often suffers from data that online retailers poorly maintain. This situation can be observed on consumer-to-consumer marketplaces as well as on business-to-consumer platforms. One way to improve search quality is to perform linguistic enhancement of the product data. In this case, Named Entity Recognition is primarily used to identify important content and give it a higher weighting in the search. Our approach detects e-commerce entity types, such as products, brands, and various product attributes. Because of the low availability of existing resources and linguistic complexity identifying these entity types is challenging. Therefore, we acquire data from two online e-commerce marketplaces to build six German datasets based on product titles and descriptions. For these datasets, we evaluate the NER performance of the state-of-the-art models BERT, RoBERTa, and XLM-RoBERTa. The best performance archived the XLM-RoBERTa model with an F1 score of 0.8611 averaged over all datasets.
引用
收藏
页码:353 / 366
页数:14
相关论文
共 42 条