Dark Web Text Classification by Learning through SVM Optimization

被引:5
|
作者
Murty, Ch A. S. [1 ]
Rughani, Parag H. [2 ]
机构
[1] Ctr Dev Adv Comp C DAC, Hyderabad, India
[2] Natl Forens Sci Univ, Digital Forens, Gandhinagar, Gujarat, India
关键词
Darkweb; SVM; classification; Darkweb content classification;
D O I
10.12720/jait.13.6.624-631
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The Darkweb has become the largest repository of unauthorized information compared to the surface web because of its benefit of anonymity and privacy. With these anonymity and privacy features, the dark web is also becoming a safe place for illegal activities and hence an increase of dark web usage and size of the onion-based URLs. With the increasing use of dark web users, it is the need for cybercrime investigators across the globe to classify dark web data for understanding various illegal activities to control and categorize URLs hosting such illicit activities with feature engineering. In this research, the Support Vector Machines (SVM) algorithm is used to understand the algorithm's efficiency for a proposed model to classify dark web data with optimization techniques. Text-based keywords from more than 1800 websites were collected by applying feature engineering techniques and the system's performance was evaluated with the SVM approach. The results are very encouraging as the Precision, Recall, and F-measure values are 0.83, 0.90 & 0.96 achieved with a dataset of 1800 URLs.
引用
收藏
页码:624 / 631
页数:8
相关论文
共 50 条
  • [31] Comparison of SVM and some older classification algorithms in text classification tasks
    Colas, Fabrice
    Brazdil, Pavel
    ARTIFICIAL INTELLIGENCE IN THEORY AND PRACTICE, 2006, 217 : 169 - +
  • [32] Classification of Illegal Activities on the Dark Web
    He, Siyu
    He, Yongzhong
    Li, Mingzhe
    PROCEEDINGS OF THE 2ND INTERNATIONAL CONFERENCE ON INFORMATION SCIENCE AND SYSTEMS (ICISS 2019), 2019, : 73 - 78
  • [33] Understanding the Impact of Label Skewness and Optimization on Federated Learning for Text Classification
    Francis, Sumam
    Uma, Kanimozhi
    Moens, Marie-Francine
    COMPANION OF THE WORLD WIDE WEB CONFERENCE, WWW 2023, 2023, : 1161 - 1166
  • [34] Web filtering using text classification
    Du, RB
    Safavi-Naini, R
    Susilo, W
    ICON 2003: 11TH IEEE INTERNATIONAL CONFERENCE ON NETWORKS, 2003, : 325 - 330
  • [35] Text classification for Chinese web documents
    Hu, Ming
    Xu, Jianchao
    Hu, Liang
    COMPUTATIONAL METHODS, PTS 1 AND 2, 2006, : 1171 - +
  • [36] Improving text classification by web corpora
    Guzman, Rafael
    Montes, Manuel
    Rosso, Paolo
    Villasenor, Luis
    ADVANCES IN INTELLIGENT WEB MASTERING, 2007, 43 : 154 - +
  • [37] The Instructional Design of Chinese Text Classification based on SVM
    Wei, Sichao
    Guo, Jianyi
    Yu, Zhengtao
    Chen, Peng
    Xian, Yantuan
    2013 25TH CHINESE CONTROL AND DECISION CONFERENCE (CCDC), 2013, : 5114 - 5117
  • [38] Research on Text Classification Based on SVM-KNN
    Lin, Yun
    Wang, Jie
    2014 5TH IEEE INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING AND SERVICE SCIENCE (ICSESS), 2014, : 842 - 844
  • [39] SVM Algorithm in the Application of Educational Resources Text Classification
    Wang, Xuejun
    Hu, Xuewei
    Liu, Dan
    2011 3RD WORLD CONGRESS IN APPLIED COMPUTING, COMPUTER SCIENCE, AND COMPUTER ENGINEERING (ACC 2011), VOL 1, 2011, 1 : 66 - 70
  • [40] An Efficient Hindi Text Classification Model Using SVM
    Puri, Shalini
    Singh, Satya Prakash
    COMPUTING AND NETWORK SUSTAINABILITY, 2019, 75