Dark Web Text Classification by Learning through SVM Optimization

被引:5
|
作者
Murty, Ch A. S. [1 ]
Rughani, Parag H. [2 ]
机构
[1] Ctr Dev Adv Comp C DAC, Hyderabad, India
[2] Natl Forens Sci Univ, Digital Forens, Gandhinagar, Gujarat, India
关键词
Darkweb; SVM; classification; Darkweb content classification;
D O I
10.12720/jait.13.6.624-631
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The Darkweb has become the largest repository of unauthorized information compared to the surface web because of its benefit of anonymity and privacy. With these anonymity and privacy features, the dark web is also becoming a safe place for illegal activities and hence an increase of dark web usage and size of the onion-based URLs. With the increasing use of dark web users, it is the need for cybercrime investigators across the globe to classify dark web data for understanding various illegal activities to control and categorize URLs hosting such illicit activities with feature engineering. In this research, the Support Vector Machines (SVM) algorithm is used to understand the algorithm's efficiency for a proposed model to classify dark web data with optimization techniques. Text-based keywords from more than 1800 websites were collected by applying feature engineering techniques and the system's performance was evaluated with the SVM approach. The results are very encouraging as the Precision, Recall, and F-measure values are 0.83, 0.90 & 0.96 achieved with a dataset of 1800 URLs.
引用
收藏
页码:624 / 631
页数:8
相关论文
共 50 条
  • [21] Efficient text classification by weighted proximal SVM
    Zhuang, D
    Zhang, BY
    Yang, Q
    Yan, J
    Chen, Z
    Chen, Y
    Fifth IEEE International Conference on Data Mining, Proceedings, 2005, : 538 - 545
  • [22] SVM based adaptive learning method for text classification from positive and unlabeled documents
    Tao Peng
    Wanli Zuo
    Fengling He
    Knowledge and Information Systems, 2008, 16 : 281 - 301
  • [23] Text Classification Using SVM with Exponential Kernel
    Chen, Junting
    Zhong, Jian
    Xie, Yicai
    Cai, Caiyun
    COMPUTER AND INFORMATION TECHNOLOGY, 2014, 519-520 : 807 - +
  • [24] Augmented SVM with Ordinal Partitioning for Text Classification
    Shi, Yong
    Li, Peijia
    Niu, Lingfeng
    2017 IEEE/WIC/ACM INTERNATIONAL CONFERENCE ON WEB INTELLIGENCE (WI 2017), 2017, : 959 - 962
  • [25] A New SVM Method for Short Text Classification Based on Semi-Supervised Learning
    Yin, Chunyong
    Xiang, Jun
    Zhang, Hui
    Wang, Jin
    Yin, Zhichao
    Kim, Jeong-Uk
    2015 4TH INTERNATIONAL CONFERENCE ON ADVANCED INFORMATION TECHNOLOGY AND SENSOR APPLICATION (AITS), 2015, : 100 - 103
  • [26] A Generative Adversarial Learning Framework for Breaking Text-Based CAPTCHA in the Dark Web
    Zhang, Ning
    Ebrahimi, Mohammadreza
    Li, Weifeng
    Chen, Hsinchun
    2020 IEEE INTERNATIONAL CONFERENCE ON INTELLIGENCE AND SECURITY INFORMATICS (ISI), 2020, : 169 - 174
  • [27] Web Potential Customer Classification Based on SVM
    Sun, Lei
    Duan, Zhu
    2012 INTERNATIONAL CONFERENCE ON INDUSTRIAL CONTROL AND ELECTRONICS ENGINEERING (ICICEE), 2012, : 568 - 570
  • [28] CLASSIFICATION OF POLSAR IMAGES BASED ON SVM WITH SELF-PACED LEARNING OPTIMIZATION
    Chen, Wenshuai
    Hai, Dong
    Gou, Shuiping
    Jiao, Licheng
    IGARSS 2018 - 2018 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM, 2018, : 4491 - 4494
  • [29] An improved SVM web page classification algorithm
    Ren, Xun-yi
    Shi, Chen
    Zhang, Dan
    Wang, Wen-si
    2018 INTERNATIONAL SYMPOSIUM ON POWER ELECTRONICS AND CONTROL ENGINEERING (ISPECE 2018), 2019, 1187
  • [30] Deep Learning-Based Text Classification to Improve Web Service Discovery
    Meghazi, Hadj Madani
    Mostefaoui, Sid Ahmed
    Maaskri, Moustafa
    Aklouf, Youcef
    COMPUTACION Y SISTEMAS, 2024, 28 (02): : 529 - 542