Dark Web: E-Commerce Information Extraction Based on Name Entity Recognition Using Bidirectional-LSTM

被引:5
|
作者
Shah, Syed Afeef Ahmed [1 ]
Masood, Muhammad Ali [1 ]
Yasin, Amanullah [1 ]
机构
[1] Air Univ, Dept Creat Technol, Islamabad 44000, Pakistan
来源
IEEE ACCESS | 2022年 / 10卷
关键词
Electronic commerce; Crawlers; Data mining; Task analysis; Deep learning; Information retrieval; Training data; Natural language processing; Convolutional neural networks; Dark Web; Name Entity Recognition; natural language processing; bidirectional LSTM; convolutional neural network; word embedding; !text type='HTML']HTML[!/text; e-commerce; dark-web; entities detection; marketplace;
D O I
10.1109/ACCESS.2022.3206539
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Information extraction from e-commerce platform is a challenging task. Due to significant increase in number of ecommerce marketplaces, it is difficult to gain good accuracy by using existing data mining techniques to systematically extract key information. The first step toward recognizing e-commerce entities is to design an application that detects the entities from unstructured text, known as the Named Entity Recognition (NER) application. The previous NER solutions are specific for recognizing entities such as people, locations, and organizations in raw text, but they are limited in e-commerce domain. We proposed a Bi-directional LSTM with CNN model for detecting e-commerce entities. The proposed model represents rich and complex knowledge about entities and groups of entities about products sold on the dark web. Different experiments were conducted to compare state-of-the-art baselines. Our proposed approach achieves the best performance accuracy on the Dark Web dataset and Conll-2003. Results show good accuracy of 96.20% and 92.90% for the Dark Web dataset and the Conll-2003 dataset, which show good performance compared to other cutting-edge approaches.
引用
收藏
页码:99633 / 99645
页数:13
相关论文
共 28 条