Dark Web Text Classification by Learning through SVM Optimization

被引:5
|
作者
Murty, Ch A. S. [1 ]
Rughani, Parag H. [2 ]
机构
[1] Ctr Dev Adv Comp C DAC, Hyderabad, India
[2] Natl Forens Sci Univ, Digital Forens, Gandhinagar, Gujarat, India
关键词
Darkweb; SVM; classification; Darkweb content classification;
D O I
10.12720/jait.13.6.624-631
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The Darkweb has become the largest repository of unauthorized information compared to the surface web because of its benefit of anonymity and privacy. With these anonymity and privacy features, the dark web is also becoming a safe place for illegal activities and hence an increase of dark web usage and size of the onion-based URLs. With the increasing use of dark web users, it is the need for cybercrime investigators across the globe to classify dark web data for understanding various illegal activities to control and categorize URLs hosting such illicit activities with feature engineering. In this research, the Support Vector Machines (SVM) algorithm is used to understand the algorithm's efficiency for a proposed model to classify dark web data with optimization techniques. Text-based keywords from more than 1800 websites were collected by applying feature engineering techniques and the system's performance was evaluated with the SVM approach. The results are very encouraging as the Precision, Recall, and F-measure values are 0.83, 0.90 & 0.96 achieved with a dataset of 1800 URLs.
引用
收藏
页码:624 / 631
页数:8
相关论文
共 50 条
  • [41] An optimal svm-based text classification algorithm
    Wang, Zi-Qiang
    Sun, Xia
    Zhang, De-Xian
    Li, Xin
    PROCEEDINGS OF 2006 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-7, 2006, : 1378 - +
  • [42] A SVM Text Classification Approch Based on Binary Tree
    Zheng Weifa
    2009 INTERNATIONAL FORUM ON COMPUTER SCIENCE-TECHNOLOGY AND APPLICATIONS, VOL 3, PROCEEDINGS, 2009, : 455 - 458
  • [43] Feature selection in text classification via SVM and LSI
    Wang, Ziqiang
    Zhang, Dexian
    ADVANCES IN NEURAL NETWORKS - ISNN 2006, PT 1, 2006, 3971 : 1381 - 1386
  • [44] Comparison of the accuracy of SVM kernel functions in text classification
    Kalcheva, Neli
    Karova, Milena
    Penev, Ivaylo
    PROCEEDINGS OF THE 2020 INTERNATIONAL CONFERENCE ON BIOMEDICAL INNOVATIONS AND APPLICATIONS (BIA 2020), 2020, : 141 - +
  • [45] Research on Classification of Chinese Text Data Based on SVM
    Lin, Yuan
    Yu, Hongzhi
    Wan, Fucheng
    Xu, Tao
    2017 2ND INTERNATIONAL SEMINAR ON ADVANCES IN MATERIALS SCIENCE AND ENGINEERING, 2017, 231
  • [46] The Research of Semantic Kernel in SVM for Chinese Text Classification
    Mai Fanjin
    Huang Ling
    Tan Jing
    Wang Xinzheng
    IIP'17: PROCEEDINGS OF THE 2ND INTERNATIONAL CONFERENCE ON INTELLIGENT INFORMATION PROCESSING, 2017,
  • [47] Unbalanced Web Phishing Classification through Deep Reinforcement Learning
    Maci, Antonio
    Santorsola, Alessandro
    Coscia, Antonio
    Iannacone, Andrea
    COMPUTERS, 2023, 12 (06)
  • [48] Optimization classification of sunflower recognition through machine learning
    Kaur, Rupinder
    Jain, Anubha
    Kumar, Sarvesh
    MATERIALS TODAY-PROCEEDINGS, 2022, 51 : 207 - 211
  • [49] Dark Side of the Web: Dark Web Classification Based on TextCNN and Topic Modeling Weight
    Shin, Gun-Yoon
    Jang, Younghoan
    Kim, Dong-Wook
    Park, Sungjin
    Park, A-Ran
    Kim, Younghwan
    Han, Myung-Mook
    IEEE ACCESS, 2024, 12 : 36361 - 36371
  • [50] Web search with text categorization using Probabilistic Framework of SVM
    Lim, B. P. C.
    Tsui, M. H.
    Charastrakul, V.
    Shi, D.
    2006 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS, VOLS 1-6, PROCEEDINGS, 2006, : 2950 - +