Content-based text classiriers for pornographic web filtering

被引:9
|
作者
Polpinij, Jantima [1 ]
Chotthanom, Anirut [1 ]
Sibunruang, Chumsak [1 ]
Chamchong, Rapeepom [1 ]
Puangpronpitag, Somnuk [1 ]
机构
[1] Mahasarakham Univ, Fac Informat, Maha Sarakham 44150, Thailand
来源
2006 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS, VOLS 1-6, PROCEEDINGS | 2006年
关键词
pornographic web filtering; text classification; Naive Bayes; support vector machines;
D O I
10.1109/ICSMC.2006.384926
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Due to the flood of pornographic web sites on the internet, effective web filtering systems are essential. Web filtering based on content has become one of the important techniques to handle and filter inappropriate information on the web. We examine two machine learning algorithms (Support Vector Machines and Naive Bayes) for pornographic web filtering based on text content. We then focus initially on Thai-language and English-language web sites. In this paper, we aim to investigate whether machine learning algorithms are suitable for web sites classification. The empirical results show that the classifier based Support Vector Machines are more effective for pornographic web filtering than Naive Bayes classifier after testing, especially an effectiveness for the over-blocking problem.
引用
收藏
页码:1481 / +
页数:2
相关论文
共 50 条
  • [31] WebAngels filter:A violent Web filtering engine using textual and structural content-based analysis
    Guermazi, Radhouane
    Hammami, Mohamed
    Hamadou, Abdelmajid Ben
    ADVANCES IN DATA MINING, PROCEEDINGS: MEDICAL APPLICATIONS, E-COMMERCE, MARKETING, AND THEORETICAL ASPECTS, 2008, 5077 : 268 - +
  • [32] Hybrid collaborative filtering and content-based filtering for improved recommender system
    Jung, KY
    Park, DH
    Lee, JH
    COMPUTATIONAL SCIENCE - ICCS 2004, PT 1, PROCEEDINGS, 2004, 3036 : 295 - 302
  • [34] Text Classification Models for Web Content Filtering and Online Safety
    Liu, Shuhua
    Forss, Thomas
    2015 IEEE INTERNATIONAL CONFERENCE ON DATA MINING WORKSHOP (ICDMW), 2015, : 961 - 968
  • [35] Content-based methodology for anomaly detection on the web
    Last, M
    Shapira, B
    Elovici, Y
    Zaafrany, O
    Kandel, A
    ADVANCES IN WEB INTELLIGENCE, 2003, 2663 : 113 - 123
  • [36] Content-Based Sensor Search for the Web of Things
    Truong, Cuong
    Roemer, Kay
    2013 IEEE GLOBAL COMMUNICATIONS CONFERENCE (GLOBECOM), 2013, : 2654 - 2660
  • [37] Query optimization method based on automaton for content-based filtering
    Wang, Tong
    Liu, Daxin
    Lin, Xuanzuo
    20TH INTERNATIONAL CONFERENCE ON ADVANCED INFORMATION NETWORKING AND APPLICATIONS, VOL 2, PROCEEDINGS, 2006, : 724 - +
  • [38] Content-based table retrieval for web queries
    Sun, Yibo
    Yan, Zhao
    Tang, Duyu
    Duan, Nan
    Qin, Bing
    NEUROCOMPUTING, 2019, 349 : 183 - 189
  • [39] Content-based filtering for music recommendation based on ubiquitous computing
    Kim, Jong-Hun
    Kang, Un-Gu
    Lee, Jung-Hyun
    INTELLIGENT INFORMATION PROCESSING III, 2006, 228 : 463 - +
  • [40] Content-Based Network Filtering of Encrypted Image Data
    Hamdi, Mohamed
    Boudriga, Noureddine
    2010 IEEE INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS, NETWORKING AND INFORMATION SECURITY (WCNIS), VOL 1, 2010, : 343 - 348